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We update the measurement of the top production cross section using 
the CDF detector at the Fermilab Tevatron. This measurement uses it decays 
to the final states e + z^+jets and n + v+jets. We search for b quarks from t 
decays via secondary- vertex identification or the identification of semileptonic 
decays of the b and cascade c quarks. The background to the ti production 
is determined primarily through a Monte Carlo simulation. However, we cal- 
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ibrate the simulation and evaluate its uncertainty using several independent 
data samples. For a top mass of 175 GeV/c 2 , we measure o t i = 5.1 ± 1.5 
pb and a t i = 9.2 ± 4.3 pb using the secondary vertex and the lepton tagging 
algorithms, respectively. Finally, we combine these results with those from 
other ti decay channels and obtain o t i = 6.51*4 pb. 

PACS number(s): 14.65.Ha, 13.85.Ni, 13.85.Qk 

I. INTRODUCTION 

In the Standard Model (SM) the top quark completes the third fermion generation. 
The measurement of the top-quark pair production cross section <j t i is of interest as a test 
of quantum chromodynamics (QCD) predictions. Aside from the obvious observation that a 
deviation from these predictions could be indicative of new physics, recent QCD calculations 
predict a t i with an uncertainty smaller than 15% [[[J which motivates measurements of 
comparable precision. 

In pp collisions at yfs = 1.8 TeV top quarks are pair produced through qq annihilation 
(~ 90%) or gluon fusion (~ 10%). In the SM framework the top quark decays into a W 
boson and a h quark. When one of the W bosons decays to an electron or a muon, the final 
state includes a charged lepton with high transverse momentum (pr), a large transverse 
energy imbalance from the undetected neutrino, referred to as JZl T , and four jets from the 
hadronized quarks. However, due to gluon radiation or jet merging, the number of detected 
jets may vary. We measure a t i using this final state, referred to in this paper as W+ jets 
and selected from the data (105.1 pb -1 ) collected by CDF in the 1992 — 1995 collider run. 

The same data set has been used in the previous CDF measurement of o t i 0. This 
paper revises that measurement and expands on many of the analysis details. The selection 
of the W+ jet sample follows the guide-lines used in all previous CDF measurements of the 
top mass and production cross section PQ. 
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As done in previous analyses, we employ two techniques to enhance the relative fraction 
of events coming from top quark decays with respect to the background. The first method 
searches a jet for the presence of a secondary vertex reconstructed using the silicon vertex 
detector (SVX) and displaced from the primary event vertex due to the long 6-quark lifetime 
(SECVTX tag). The second method searches a jet for the presence of a lepton, indicative 
of a semileptonic 6-decay. Since these leptons typically have low momentum compared to 
the lepton from the W decay, they are referred to as soft lepton tags (SLT). 

In this analysis we use the same SECVTX and SLT algorithms as in Ref. ||. Differently 
from Refs. |||||5|], we search jets and not events for soft lepton tags; this approach provides 
the rates of backgrounds for the top mass measurement which uses the same technique [||]. 

As a cross-check, we take advantage of a third algorithm, jet-probability, which uses 
the impact parameter significance of all tracks in a jet to derive a probability that the jet 
originates from the primary event vertex ||. Jets with small probability of having zero 
lifetime are considered jet-probability tags (JPB). The value of the jet-probability threshold 
is tuned to have higher tagging efficiency than SECVTX in jets originating from c quarks 
and a higher rate of fake tags in jets without heavy flavor. Although similar to SECVTX 
in that it is based on track information from displaced vertices, it is used here to provide 
important cross-checks of the background determination and measured cross section. 

The method used to measure a t i is outlined in Ref. || and has since been improved. As 
summarized in Ref. [[J , the method relies on the calculation of all the background contribu- 
tions to the tagged W+jet sample. The excess over background of the W + 3, 4 jet events 
with at least one tag is attributed to ti production and used to derive <j t t- 

The major sources of background are the processes pp — > Wg with g — > bb, cc (referred 
to as gluon splitting) and pp — > Wc. The second largest source of background is mistags 
(tags in jets which do not contain heavy flavor). Smaller contributions come from other 
processes like noia-W production, single top production, WW, WZ, ZZ and Z — > tt. 

The method used to measure a t i relies on the correct calibration of the Monte Carlo 
generators and the detector simulation. Simulated events are produced with the Herwig |7j 
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or Pythia || Monte Carlo generators. Hadrons with heavy flavor (b and c) are decayed using 
the CLEO Monte Carlo (QQ) 0. All other particles are decayed, when appropriate, by 
the CDF detector simulation (QFL) which uses its own lifetime table for b and c-hadrons. 
QFL simulates the interaction of all particles in the final state with the CDF detector; the 
detector response is based on parametrizations that are functions of the particle kinematics 
and have been derived using the data. 

This paper describes the work done to understand and improve the calibrations used in 
the method to calculate the background to tt events using independent data samples and 
the corresponding simulations. This work was primarily focussed on the components with 
the largest influence on the determination of a t f, mistags, the efficiencies of the tagging 
algorithms, and the fraction of W+ jet direct production which contains heavy flavor. We 
summarize here the relevant conclusions: 

• we find that the SECVTX tagging efficiency for 6-quark jets (6-jets) is (25±13)% higher 
in data control samples than it is in the Monte Carlo simulation of the same processes. 
Therefore we conclude that the 6-jet tagging rate in Refs. |2||| is underestimated by this 
factor. This data-to-simulation discrepancy is largely due to errors in the simulation 
that were found a posteriori. Instead of remaking the large Monte Carlo samples used 
in this paper we chose to rescale the simulated 6-quark tagging rate by the factor of 
1.25±0.13. 

• we find that the rate of SECVTX mistags in jets without heavy flavor is (50±5)% 
smaller than what we estimated in Refs. @J|. 

• we find the fraction of g — > bb and g — *> cc in the W+ jet direct production evaluated 
with Herwig needs to be increased by (39 ± 19)% and (35 ± 36)%, respectively. 

These last two effects tend to cancel, leaving the net background to top approximately 
unchanged from our previous results in Refs. In the W + 3,4 jets sample we observe 

29 events with one or more SECVTX tags and 25 events with one or more SLT tags. The 
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expected backgrounds are 8.0±1.0 and 13.2±1.2 events, respectively. The excess of SECVTX 
tags yields the cross section a t j = 5.08 ± 1.54 pb and the excess of SLT tags yields o# = 
9.18 ± 4.26 pb for a top quark mass of 175 GeV/c 2 . 

Following a brief description of the CDF detector in Section 0, Section [TTI| describes 
the triggers and the reconstruction of leptons, jets and the missing transverse energy. The 
selection of W+ jet events is detailed in Section [TV], along with the selection of the Z+ 
jet sample, which will be used to check the background calculation. The selection of other 
data samples used to calibrate the event generators and the detector simulation is described 
in Section |V|. Sections |V| and [VI I| are dedicated to the Monte Carlo generators and the 
CDF detector simulation (QFL), respectively. Section [VII]] describes the algorithms used 
for the identification of the heavy flavor. The efficiency of those algorithms is calculated 



in Section [IX|, which also includes numerous checks of the result and an evaluation of its 
systematic uncertainty. The new method for evaluating mistags and the determination of its 
accuracy is described in Section [X|. Section |X1] details the calibration of the g — > bb and g — ► 
cc cross sections predicted by the Herwig generator. Section |X1J| describes the calculation of 
the backgrounds to the tt production. In Section KTTIl we check the background calculation 
using the Z+ jet sample. Additional checks of the background calculation are described in 
Section [XIV[ Finally, a t t is derived in Section [XV| . In Section [XVI| , we combine the present 
results with previous CDF measurements of o t i that are derived using different data sets. 



We conclude in Section XVII. 



II. THE CDF DETECTOR 

CDF is a general purpose detector with azimuthal and forward-backward symmetry 
designed to study pp interactions. The CDF coordinate system has the z-axis pointing 
along the proton momentum and the x-axis located in the horizontal plane of the Tevatron 
storage ring pointing radially outward so that the |/-axis points up. The coordinates r-<p 
are the standard cylindrical coordinates. A complete description of CDF can be found in 
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Refs. p,|TO||. The detector components most relevant to this analysis are summarized below. 

A superconducting solenoid of length 4.8 m and radius 1.5 m generates a 1.4-T magnetic 
field. The solenoid contains three types of tracking chambers for detecting charged particles 
and measuring their momenta. A four layer silicon microstrip vertex detector (SVX) sur- 
rounds the beryllium beam pipe of radius 1.9 cm. The SVX has an active length of 51 cm; 
the four layers of the SVX are at distances of approximately 2.9, 4.2, 5.5 and 7.9 cm from 
the beamline. Axial micro-strips with 60-/im pitch provide accurate track reconstruction in 
the r-(f) plane transverse to the beam H - Outside the SVX there is a vertex drift chamber 



(VTX) which provides track information up to a radius of 22 cm and for pseudo-rapidities 
\r]\ < 3.5. The VTX measures the z-position of the primary vertex. Both the SVX and the 
VTX are mounted inside the CTC, a 3.2 m long drift chamber with an outer radius of 132 cm 
containing 84 concentric, cylindrical layers of sense wires, which are grouped into 8 alternat- 
ing axial and stereo superlayers. The solenoid is surrounded by sampling calorimeters used 
to measure the electromagnetic and hadronic energy of jets and electrons. The calorimeters 
cover the pseudo-rapidity range \r]\ < 4.2. The calorimeters are segmented in r]-<j) towers 
pointing to the nominal interaction point. There are three separate 7/-regions of calorimeters. 
Each region has an electromagnetic calorimeter [central (CEM), plug (PEM) and forward 
(FEM)] and behind it a hadron calorimeter [CHA, PHA and FHA, respectively]. Located 
six radiation lengths inside the CEM calorimeter, proportional wire chambers (CES) pro- 
vide shower-position measurements in the z and r — <fi view. Proportional chambers (CPR) 
located between the solenoid and the CEM detect early development of electromagnetic 
showers in the solenoid coil. These chambers provide r-0 information only. 

The calorimeter acts as a hadron absorber for the central muon detection system (CMU). 
The CMU consists of four layers of drift chambers located outside the CHA calorimeter. The 
CMU system covers the pseudo-rapidity \r]\ <0.6 and can be reached by muons with > 1-4 
GeV/c. The CMU system is followed by 0.6 m of steel and four additional layers of drift 
chambers (CMP). The system of drift chambers CMX extends the muon detection to |?7| < 
1.0. 
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III. DATA COLLECTION AND IDENTIFICATION OF HIGH P T LEPTONS AND 

JETS 



The last collider run, called Run I, lasted from August of 1992 till July of 1993 (Run 
1A) and from January of 1994 till July of 1995, (Run IB). The data collected during this 
run correspond to an integrated luminosity of 105.1 ±4.0 pb _1 when using the total pp cross 
section value 80.03 ± 2.24 mb jnj. We begin this section with a description of the triggers 
used in this analysis. This is followed by subsections on the reconstruction and identification 
of electrons, muons, jets and neutrinos. 

A. Triggers 

A three-level trigger system is used to select events originating from pp interactions and 
containing electrons, muons, jets or missing transverse energy (JZ! T ). 

The first-level trigger (LI) accepts events based on the identification of energy clusters 
in the calorimeter or track segments in the muon chambers. The LI calorimeter trigger 
requires a single CEM or CHA trigger tower with transverse energy greater than 8 or 12 
GeV, respectively (these thresholds were set at 6 and 8 GeV during Run 1A). The LI muon 
trigger infers the track momentum from the deflection of the track segment in the muon 
chambers due to the magnetic field; it requires a minimum transverse momentum of 6 and 
10 GeV/c in the CMU and CMX chambers, respectively. A minimum energy of 300 MeV 
is required in the hadron calorimeter tower associated with the track segment. 

The second-level trigger (L2) uses the calorimetry information with greater sophistica- 
tion. The L2 trigger is a fastbus based processor with a decision time of approximately 
20 fis. It combines calorimetry towers forming electromagnetic and jet-like clusters. An 
electromagnetic cluster is constructed as a set of contiguous CEM (PEM) towers each with 
E T >7 GeV (4 GeV), including at least one seed tower with E T > 8 GeV (6 GeV). 

The L2 jet clusters are formed starting with a seed tower with Et > 3 GeV and summing 
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all contiguous towers with Et > 1 GeV. A crude estimate of JZ! T is also available at this 
trigger level. The L2 trigger utilizes the list of r — <p tracks provided by the central fast 
tracker (CFT), a hardware processor which uses fast timing information from the CTC as 
input. The events used in this analysis were collected using the L2 high-p^ electron trigger, 
which requires an electromagnetic cluster of transverse energy greater than 16 GeV matched 
by a CFT track with transverse momentum pt > 12 GeV/c. A second trigger requires an 
electromagnetic cluster of Et > 16 GeV and Et > 20 GeV and is used to recover losses due 
to the CFT inefficiency The L2 high-p^ muon trigger requires a CFT track with j>t > 12 
GeV/c pointing within 5° to a LI track segment in the muon detectors. To ensure good 
efficiency, additional L2 muon triggers require only a LI track segment accompanied by at 
least one jet cluster with E T > 15 GeV or Et > 35 GeV. 

The L3 trigger decision is made after the full event reconstruction. Events accepted by 
the L2 trigger are processed by a farm of SGI processors running the full off-line recon- 
struction package. The Level 3 electron trigger requires a CEM cluster with Et > 18 GeV 
and a reconstructed track with pt > 13 GeV/c pointing to it. The ratio of hadronic to 
electromagnetic energy in the cluster is required to be less than 0.125. The Level 3 muon 
trigger requires a match within 10 cm in the r — <p plane between a reconstructed track with 
Pt > 18 GeV/c extrapolated to the radius of the muon detectors and a track segment in the 
muon chambers. 

Trigger efficiencies have been measured directly using events with overlapping triggers. 
The electron trigger efficiency is found to be larger than 99.6% for electrons inside the de- 
tector fiducial volume. Likewise, the muon trigger efficiency is (70 ± 2)%; this includes an 
inefficiency due to the fact that the muon trigger does not cover the entire detector fiducial 
volume. The measured trigger efficiencies have been included in the detector simulation de- 
scribed in Section |V11[ A check of the muon trigger simulation was performed by comparing 



the rate of W — > \xv events in the data to that of a simulation of this process using the 
Herwig generator (see Section |VTD normalized to the same number of W — > ev events. We 
observe agreement between data and simulation within 10%, and this difference is taken as 
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the systematic error on the muon trigger simulation. 

B. Electron reconstruction 

The W+ jet sample is selected requiring electrons reconstructed in the central pseudo- 
rapidity region \r}\ < 1. Stricter cuts, described in detail in Ref. 0, are applied to central 
electron candidates which passed the trigger prerequisites. The following variables are used 
to discriminate against charged hadrons: (1) the ratio of hadronic to electromagnetic energy 
of the cluster, E had /E em ; (2) the ratio of cluster energy to track momentum, E/P\ (3) a 
comparison of the lateral shower profile in the calorimeter cluster with that of test-beam 
electrons, L s h r ] (4) the distance between the extrapolated track-position and the CES mea- 
surement in the r — and z views, Ax and Az; (5) a x 2 comparison of the CES shower 
profile with those of test-beam electrons, X 2 striv \ (6) the interaction vertex position, z ver and 
the distance between the interaction vertex and the reconstructed track in the ^-direction, 
z-vertex match; and (7) the isolation, /, defined as the ratio of the additional transverse 
energy in a cone of radius R = 0.4 around the cluster axis to the transverse energy of the 
electron cluster. The electron selection criteria are listed in Table |I[ 

Fiducial cuts on the electromagnetic shower position, as measured in the CES, are applied 
to insure that the electron candidate is away from the calorimeter boundaries and the energy 
is well measured. 

Electrons from photon conversions are removed using an algorithm based on tracking 
information. Electron tracks close to a companion track with opposite charge are considered 
conversion candidates. The following variables are used to identify and remove photon 
conversions: (1) the difference of the polar angles, 5 cot 6] (2) the distance between the 
two tracks in the r — <ft plane at the radius R con v where the tracks are parallel, A sep ; and 
(3) the conversion radial position, R con v If a companion track is not found, we identify 
conversion candidates using fvrx which is the ratio of the measured to expected number 
of VTX hits associated to the electron candidate. Table [TJ summarizes the criteria used 
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to identify and remove electrons from photon conversions. The efficiency of the conversion 
algorithm is measured with a sample of photon conversions selected using the CPR detector. 
The efficiency of the conversion removal algorithm is 90.7 ± 3.8%. The fraction of electrons 
erroneously removed is estimated using a sample of Z —>■ e + e~ events to be 2.2 ± 0.6% and 
is properly accounted for by the simulation. 

The total primary electron identification efficiency has been measured using a sample of 
Z — > e + e~ decays and is listed in Table |HI[ 

When an electron candidate is found, the calorimeter towers belonging to the electron 
cluster are not used by the jet clustering algorithm. 

C. Muon reconstruction 

Muons are identified in the \rj\ < 1.0 region by extrapolating CTC tracks to the muon 
detectors and matching them to track segments reconstructed in the muon chambers. The 
following variables, described in detail in Ref . , are used to separate muon candidates from 
cosmic rays and from hadrons not contained by the calorimeter: (1) an energy deposition in 
the electromagnetic and hadronic calorimeter characteristic of minimum ionizing particles, 
E em and E had ; (2) the distance of closest approach of the reconstructed track to the beam 
line, d; (3) the ^-vertex match; (4) the matching distance between the extrapolated track 
and the track segment in the muon chamber, Ax = r A0; and (5) the isolation /, the ratio 
of additional transverse energy in a cone of radius R = 0.4 around the track direction to the 
muon transverse momentum. 

The muon selection criteria are listed in Table |IV|. The muon identification efficiency has 
been measured using a sample of Z — > decays and is listed in Table p| . 

Leptons passing the requirements listed in Table | and f[V] are labeled primary leptons. 
As a consequence of the high luminosity of the collider, approximately 50% of the events 
with a primary lepton contain multiple interactions which result in more than one primary 
vertex in the event. The ambiguity is resolved by selecting the vertex associated with the 
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primary lepton track to evaluate jet pseudorapidities and the missing transverse energy. 
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TABLE I. Selection requirements for primary electrons. 



Variable Cut 

E/P < 1.5 

Ehad/Eem < 0.05 

L s hr < 0.2 

|Ax| < 1.5 cm 

\Az\ < 3.0 cm 

Xltriv < 10-0 

z ver < 60.0 cm 

z- vertex match < 5.0 cm 

I < 0.1 



TABLE II. Criteria used to identify electrons from photon conversions. 



Variable Cut 
\A sep \ < 0.3 cm 

|<Scot0| < 0.06 

Rconv -20 < cm and < 50 cm 
fvrx < 20% 



TABLE III. Lepton identification efficiencies, including the isolation requirement, measured 



using a sample of Z — > t 


'I events collected during Run IB. In Run 1A the muon efficiency is 


(7.8 ± 2.8)% lower. 




Lepton type 


Efficiency 


Electrons 


0.81 ± 0.02 


Muons 


0.93 ± 0.03 
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TABLE IV. Selection requirements for primary muons. 



Variable 



Cut 



Ehad 

Eem ~i~ E-*had 
\d\ 

|Aar| 

| Zv er | 

z-vertex match 
I 



< 2 GeV 

< 6 GeV 
> 0.1 GeV 

< 0.3 cm 

< 2.0 cm (CMU) 
< 5.0 cm (CMP,CMX) 

< 60.0 cm 

< 5.0 cm 
< 0.1 
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D. jet reconstruction 



The CDF jet reconstruction algorithm uses a cone of fixed radius in the r\ — space. 
In this analysis we use a cone of radius 0.4 which has been shown to contain approximately 



70% of the jet energy [14]. A detailed description of the jet reconstruction algorithm can be 
found in Ref . [14 . 

The jet energy resolution can be parametrized as <j(Et) / E? ~ 1/ \/Et, where E? is mea- 
sured in GeV. Effects which contribute to the resolution are the lower calorimeter response 
at the boundaries of different towers and of different calorimeter detectors, the loss of low 
momentum particles inside the magnetic field, the energy deposition in towers outside the 
clustering cone, the contribution of the underlying-event and energy losses due to minimum 
ionizing particles or neutrinos present in the jet. Corrections meant to reproduce the average 
jet Et correctly (without improving the energy resolution) are often used [|T^ , [T5|] . The av- 
erage jet energy correction factor ranges from approximately 1.7 to 1.1 as the jet transverse 
energy increases from 15 to 100 GeV. 

Checks of the jet energy corrections have been performed in Ref. || by studying the 
momentum balance in 7+ jet and Z+jet events. The energy imbalance is measured to be 
within 3% of the Z or photon energy. However, the uncertainty in the modeling of the 
large-angle gluon emission results in a 10% systematic uncertainty of the jet energy scale. 



E. Ei T and neutrino reconstruction 

The presence of neutrinos is inferred from transverse energy imbalance in the detector. 
The transverse missing energy is defined as 

$t = — I X! E l T rii I 

i 

where, E l T is the magnitude of the transverse energy contained in each calorimeter tower 
i in the pseudo-rapidity region |?7| < 3.5 and fii is the direction of the tower in the plane 
transverse to the beam direction. When a muon is present in the event, FJ T is calculated as: 
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Et =-|E^ + ^ m ( 1 -^J 



where Ej, is the transverse energy deposited by the muon in the calorimeter and p T is the 
muon transverse momentum. 



IV. SELECTION OF THE W AND Z+ JET SAMPLES 

The W+ jet sample, which contains the ti signal, is selected from the high-pr inclu- 
sive lepton data set by requiring at least one primary electron with Et > 20 GeV or one 
primary muon with pt > 20 GeV/c, JZl T > 20 GeV and at least one jet with uncorrected 
transverse energy Et > 15 GeV and pseudo-rapidity \r]\ < 2. An appreciable fraction of 
these events is due to Z+ jet production. Some Z events can be identifyied and removed 
when the second lepton from the Z decay falls into the detector acceptance. Because W+ 
jet and Z+ jet events have similar production mechanisms, we will use the Z+ jet sample 
to check our evaluation of the backgrounds to ti production. It is also interesting to study 
this sample because events in which one of the two leptons is not identified (unidentified 
Z's) are a background to ti production. The following subsection explains the removal of 
dilepton events. The events surviving dilepton removal constitute the W+ jet sample which 
is described in the last subsection. 



A. Selection of the Z+ jet sample 

Z candidates are selected from the high-p^ lepton data set by requiring a primary 
electron with Et > 20 GeV and by searching for a second lepton of opposite charge which 
satisfies the criteria listed in Table 0. 

Searching for additional electrons we relax the isolation and Ehad/Eem cuts. We also 
search in the PEM and FEM detectors. Additional muons are searched for by relaxing all 
selection cuts defining primary muons. As shown in Table [V], CTC tracks without a match 
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to a track segment in the muon chambers but pointing to a calorimeter tower with a small 
energy deposition are also considered muon candidates. 

Events are flagged as Z candidates if the invariant mass of the lepton pair falls in the 
range 70 < Ma < 110 GeV/c 2 (see Figure [I]). The number of Z candidate events as a 
function of the jet multiplicity is shown in Table [V|. 
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FIG. 1. Invariant mass distributions of electron and muon pairs before and after requiring the 
presence of at least one jet with Et > 15 GeV and |r/| < 2. The shaded area indicates the mass 
window used to select Z candidate events. 
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TABLE V. Selection requirements for loose leptons. 

Variable Cut 

Electrons 

H < 4.2 

E T > 10 GeV 

Ehad/E em < 0.12 

I < 0.15 

Muons with a track segment in the muon chambers 

p T > 10 GeV/c 

|d| < 0.5 cm 

z-vertex match < 10 cm 

|Ax| < 10 cm 

E em < 5 GeV 

E had < 10 GeV 

/ < 0.15 

Muons without a track segment in the muon chambers 

p T > 10 GeV/c 

|d| < 0.5 cm 

^-vertex match < 10 cm 

E em + E had < 10 GeV 

(E em < 2 or Ehad < 6 GeV) 

/ < 0.15 



TABLE VI. Number of Z candidate events as a function of the observed jet multiplicity. 

Jet multiplicity Z — > e + e~ Z — > Total 

ljet 791 357 1148 

2 jets 107 52 159 

3 jets 9 7 16 
> 4 jets 3 1 4 
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B. Dilepton removal 



All events containing a primary lepton and at least one additional lepton selected using 
the criteria listed in Table are removed from the W+ jet sample. These events arise from 
Z — > t~t + , di-bosons, Drell-Yan and tt production. The a t i production cross section using 
dilepton events has been measured in Ref. [IB] and we want to avoid obvious correlations. 

We also remove events containing an isolated track with > 10 GeV/c with charge 
opposite to the primary lepton [17]. The majority of these events originates from genuine 
dilepton events in which one lepton is outside the region covered by the calorimeters or the 
muon detectors. 

Finally, to remove dileptons missed due to inefficiencies of the tracking system, we remove 
events in which a jet with > 15 GeV and \rj\ < 2 has a large electromagnetic fraction 
(E em / E em+had > 0.95) and less than three tracks. These types of events are mostly produced 
by Z — > e + e~ decays. 

The dilepton removal reduces the acceptance for tt events by 17.2%. 



C. The W+ jet sample 

The number of W events surviving the Z and dilepton removal is listed in Table |V11| as 
a function of the jet multiplicity. The transverse mass distribution of the W candidates is 
shown in Figure ||. 

TABLE VII. Number of W candidate events as a function of the observed jet multiplicity. 

Jet multiplicity W — * ev W — » [iv Total 

ljet 5472 3982 9454 

2 jets 744 626 1370 

3 jets 111 87 198 
> 4 jets 26 28 54 
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FIG. 2. Distribution of the transverse mass M of W candidates in the data (•) and in a 
simulation using the Herwig generator (solid histogram). We utilize measured quantities without 
the full set of corrections used to determine the W mass. 



V. ADDITIONAL DATA SAMPLES 

In addition to the Z+ jet sample, we use a number of independent data sets for the 
purpose of calibrating the Monte Carlo generators and the detector simulation. The generic- 
jet samples are described in Subsection A. We will use these sample to derive the new 
parametrization of the mistag rate, to check our evaluation of the efficiency of the tagging 
algorithms, and to calibrate the calculation of the fraction of W+ jet events with heavy 
flavor. Subsection B describes the low-pr inclusive lepton sample which will be used to 
determine the efficiency of the tagging algorithms. Finally, Subsection C details the selection 
of the isolated photon sample. We will use this sample to check the parametrization of the 
mistag rate of our tagging algorithms. 
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A. Generic-jet samples 

The samples JET 20, JET 50, JET 70 and JET 100 are data collected requiring the 
presence of a L2 calorimeter cluster with transverse energy E T > 20, 50, 70 and 100 GeV, 
respectively. 

The samples Et 175 and X) E T 300 are data collected requiring the scalar sum of the 
transverse energy of all calorimeter towers, as evaluated by the L2 trigger, to be larger than 
175 and 300 GeV, respectively 

The last generic-jet sample, J2Et 125 4CL, was collected requiring the presence of four 
L2 calorimeter clusters with Et > 15 GeV and the scalar sum of the transverse energy of 
all calorimeter towers to be larger than 125 GeV. 

The L2 triggers calculate the above quantities with respect to the nominal interaction 
point. Off-line we take as the event vertex the one with the largest ^2,Vt using all tracks 

i 

i associated with the vertex. We retain the events in which the L2 requirements are also 
matched after the event is reconstructed using this vertex. In these events, we inspect all 
jets with E T > 15 GeV and which contain at least two SVX tracks (taggable jets). 



B. The low-pT inclusive lepton sample 

The efficiency of the 6-tagging algorithms needs to be measured in a sample enriched 
in bb production. The low-pr electron sample is collected with the L2 requirement that a 
CFT track with px > 7.5 GeV/c is matched by an electromagnetic L2 cluster with Et > 
8 GeV. The fraction of electrons coming from semileptonic 6-decays is enhanced with the 



selection criteria listed in Table |VIII| . We use electrons in the CEM fiducial region and 
remove photon conversion candidates. We require the lepton to be in a cone of radius 0.4 
around the direction of a taggable jet. We require also the presence of at least one additional 
taggable jet. The 6-purity of this sample is approximately 50%. 

We check the results obtained using the low-pr electron sample using a lower statistics 
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low-pT muon sample collected using the inclusive muon trigger. In this case, a CFT track 
with px > 7.5 GeV/c must be matched to a reconstructed track-segment in both sets of 
central muon detector (CMU+CMP). Central muons which passed the trigger prerequisite 
are selected with the same criteria used for the high-p^ muons listed in Table [V] (we require 
/ > 0.1). 
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TABLE VIII. Criteria used to select the low p t inclusive electron sample. 

Variable Cut 

E T > 10 GeV 

E/P < 1.5 

Ehad/E em < 0.05 

L s hr < 0.2 

|Ax| < 1.5 cm 

\Az\ < 3.0 cm 

Xstrip — ^0 

z- vertex match < 5.0 cm 

I > 0.1 
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C. The isolated photon sample 



The isolated photon sample was collected requiring a L2 isolated electromagnetic cluster 
with Et > 16 GeV and with less than 5 GeV of additional energy in a 5x10 grid of 
calorimeter towers centered on the photon direction [18|]. Photon candidates which pass 
the L3 trigger must be in the good fiducial region of the calorimeter and there must be less 
than 4 GeV in a cone of radius 0.7 around the photon direction. Table |IX| summarizes the 



off-line criteria used for the selection of the photon sample. After requiring the presence of 
an additional jet with Et > 15 GeV and \q\ < 2, the final sample consists of 3000 7+ > 1 
jet events. The expected background contamination of the sample due to tt° and r\ decays 
is estimated to be (45.0 ± 4.5)% @|19 . 



TABLE IX. Criteria used to select the isolated photon sample. 



Variable cut-value 

E T > 23 GeV 

77 < 1.0 
E had /E em < 0.055 + 0.00045 • E T 
Transverse energy deposited in a cone of 

radius 0.7 around the 7 < 2 GeV 

CTC tracks pointing to the 7 cluster None 

Xlrip < 20 
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VI. MONTE CARLO GENERATORS 



In this analysis we use three Monte Carlo generators, Herwig [RJ, Pythia || and Vec- 



bos [20]. The acceptance for tt events is calculated using Pythia version 5.7. The tt accep- 
tance has been also evaluated using the version 5.6 of Pythia and Herwig 5.6. Herwig is used 
to estimate the fraction of W+ > 1 jet events with heavy flavor (using the process 2100). 
We first calibrate the Herwig simulation using generic-jet data, as described in Section 

Both Herwig and Pythia generators use tree-level matrix element calculations for the 
parton hard scattering, convoluted with parametrizations of the parton distribution func- 
tions. The outgoing initial and final state partons are converted into a cascade of gluons 
and qq pairs with energy and angular distributions determined by the Altarelli-Parisi equa- 



tions pl |. The strength of these generators is the modeling of the parton shower which 
accounts for the color correlation between the initial and final state partons. The parton 
shower terminates when the invariant mass of the parton falls below the perturbative QCD 
scale. At this level the partons are turned into colorless hadrons according to phenomeno- 
logical models (the process is called hadronization or fragmentation). For b and c-quarks 
the fragmentation is modeled in Pythia with the Peterson parametrization p2] . We use the 
fragmentation parameter e = 0.006 for 6-quarks and e = 0.05 for c-quarks. Herwig uses 
its own hadronization model, the settings for which are listed in Ref. [|23|j . Both generators 
include a model of the underlying event which describes the hadronization products of the 
beam remnants. 

The Vecbos Monte Carlo is used to study the part of the phase-space in the W+ > 
1 jet production that is not treated correctly by parton shower Monte Carlos, specifically 
Wbb and Wcc events in which the two b or c-partons produce two well separated jets. 
The Vecbos Monte Carlo generator provides a parton level calculation of the W+ n jet 
cross section based on the leading order matrix elements of the hard scattering. Infrared 
and collinear singularities are regulated by requiring that the final-state partons have a 
transverse momentum exceeding a cutoff value p™ m and are separated by more than -R m j n 
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(R = y(A0) 2 + (A??) 2 ). We use p™ m = 8 GeV/c and -R min =0.4. We use the renormalization 
scale Q 2 =< px > 2 , where < > is the average transverse momentum of the outgoing 
partons. We have verified that after our selection cuts the fraction of jets with heavy flavor 
calculated with Herwig matches the Vecbos prediction at the -R m i n threshold. We transform 
the partons produced by Vecbos into hadrons and jets using the Herwig program adapted 
to perform the coherent shower evolution of both initial and final-state partons [^4 . 



In summary, we use Herwig to predict the fraction of W+ > 1 jet events where only one 
jet clustered in a cone of radius 0.4 contain b or c-hadrons while we rely on Vecbos to extend 
the prediction to the cases where two different jets contain both heavy-flavored hadrons. 



We use the MRS D' set of parton distribution functions |g5| to generate W+ jet events 
because it has been shown to reproduce the results of the W asymmetry measured by 



CDF @. 

The decay of hadrons with heavy flavor produced by the Monte Carlo generators is mod- 
eled using the CLEO Monte Carlo generator (QQ) 0. We use the QQ table of branching 
ratios for each decay but our own lifetime table because decay lengths are modeled inside 
the detector simulation. 



VII. DETECTOR SIMULATION 

The QFL detector simulation is used to decay all generated particles and model their 
interactions with the various elements of the CDF detector. The detector response is based 
upon parametrizations and simple models which depend on the particle kinematics. The 
calorimeter simulation is based upon a parametrization of the calorimeter response to single 
particles parametrized as a function of the pseudo-rapidity and azimuthal angle (to account 
for cracks in the calorimetry ) and of the transverse momentum using test-beam data. After 
the simulation of the CDF detector, the Monte Carlo events are treated as if they were real 
data. 
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A. CTC track simulation 



The CTC simulation is not a hit-level simulation. It converts each particle's momentum 
vector at generator level into a reconstructed track using covariance matrices derived from 
the data. Not surprisingly, the track-reconstruction efficiency in the detector simulations is 
higher than that measured in the data. The major factor influencing the track reconstruction 
efficiency is the density of hits in the tracking detector. In this respect, the problem is 
aggravated by the fact that the Monte Carlo generators do not contain multiple interactions. 

To adjust the tracking reconstruction efficiency in the simulation, CTC hits of Monte 
Carlo generated tracks have been embedded in generic-jet data. The efficiency is deter- 
mined by counting the fraction of times the embedded track is reconstructed. The tracking 
efficiency is measured as a function of the hit density around the track for low luminosity 
runs, and then for runs of typical luminosities. Table |X| compares the track reconstruction 
efficiency in the detector simulation to the efficiency for reconstructing simulated tracks em- 
bedded in the data. The degradation of the track reconstruction efficiency is parametrized 
in the detector simulation as a function of the number of hits around the tracks and of the 
average luminosity of the data. This procedure accounts well for the dependence of the 
tracking efficiency on the jet transverse energy. 
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TABLE X. Track reconstruction efficiency for charged particles in the detector simulation (QFL) 
and for Monte Carlo tracks embedded in generic-jet data acquired in low luminosity running. The 
effect of the average luminosity of the data is shown separately. 

CTC track SVX track Luminosity Effect 

Embedded-track 0.94±0.02 0.87±0.03 0.95±0.02 

QFL simulation 0.993 0.983 1 
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B. Lepton identification efficiencies 



Aside from the efficiency for reconstructing a track, the primary lepton identification 
efficiency in the simulation depends also on how well the Monte Carlo generators reproduce 
the isolation distribution and how well the calorimeter response has been parametrized. In 
the simulation, the primary lepton identification efficiencies are measured as the ratio of the 
number of leptons passing the selection cuts listed in Tables | and [TV] to the number of leptons 
generated in the kinematical acceptance. The identification efficiencies in the simulation are 
97 ± 2% for muons and 87.5 ± 2.0% for electrons. The identification efficiencies for primary 
leptons are degraded in the detector simulation to match the ones measured in the data (see 
Table pl|) . Altogether, we degrade the rates of simulated primary leptons by the factor of 
0.936±0.125 (the error includes a 10% uncertainty on the muon trigger simulation). 

The efficiency for identifying soft lepton tags is a far more complicated problem because 
some detector responses, such as dE/dx in the CTC and the CPR chambers, have not been 
parametrized in the detector simulation. The SLT simulation weights tracks correspond- 
ing to leptons from h and c-quark decays at generator level with a parametrization of the 
efficiency of each selection cut measured using the data, as described in Section [VIII C . 



C. SVX track simulation 

The detector simulation becomes unwieldy when simulating tracks that are measured by 
both the CTC and SVX tracking detectors as is the case for input tracks to the SECVTX 
and jet-probability algorithms. The SVX track reconstruction is performed by assigning hits 
on the silicon vertex detector to previously reconstructed CTC tracks. In the data hits are 
assigned if they are contained in a road around the reconstructed CTC track determined by 
its uncertainty (4 a in the r — plane). A CTC track with at least two associated SVX hits is 
defined to be a SVX track and is refitted using the SVX hits and the CTC track parameters 
and covariance matrix. The simulation of the SVX is a hit-level simulation in which the hit 
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resolution is taken from the data. Simulated SVX tracks are reconstructed as in the data. 
However, in the data we must multiply all the elements of the covariance matrix by a factor 



of 2 so that the CTC-SVX matching uncertainty agrees with the measured resolution |27 
while there is no such need in the simulation. 

The efficiency for finding SVX tracks in the detector simulation also needs to be degraded, 
by a factor determined by measuring the efficiency for reconstructing Monte Carlo generated 
tracks embedded at hit-level in generic-jet data (see Table |X|). 

Having done this, the simulation is still not a perfect reflection of the data. For example, 



as shown in Section [VIII B| , the distribution of the impact parameter significance of SVX 
tracks in the data and in the detector simulation are slightly different. We conclude that 
it is neccessary to measure the tagging efficiencies of each algorithm in the data and in the 
simulation and correct the detector simulation for any observed difference. This is done in 



Section IX. 



VIII. DESCRIPTION OF THE TAGGING ALGORITHMS 

The presence of jets originating from b quarks is one of the characteristic signatures 
of ti events. Following previous work |2|||, we tag 6-quarks using two of their distinctive 
properties: the relatively long lifetime and the presence of semileptonic decays. Two tagging 
techniques based on tracking information using the SVX detector have been developed to 
identify jets containing heavy flavor. The Secondary Vertex Tagging algorithm (SECVTX) 
is described in Subsection A. The jet-probability algorithm, used to check SECVTX results, 
is described in Subsection B. The Soft Lepton Tagging algorithm (SLT) is discussed in 
Subsection C, which also includes the evaluation of the SLT fake rate and a description of 
the simulation of this algorithm. 
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A. SECVTX algorithm 



The SECVTX algorithm is described in more detail in Refs. [0,0. SECVTX is based on 
the determination of the primary event vertex and the reconstruction of additional secondary 
vertices using displaced tracks associated with jets. 

The positions of the pp interactions (primary vertices) are distributed along the beam 
direction according to a Gaussian with a width of approximately 28 cm. In the plane 
transverse to the beam axis, these interactions follow a distribution that is a Gaussian with 
a width of 25 /im in both the x and y dimensions. To reconstruct the primary event vertex, 
we first identify its ^-position using the tracks reconstructed in the VTX detector. When 
projected back to the beam axis, these tracks determine the longitudinal position with a 
precision of about 0.2 cm. 

The transverse position of the primary vertex is determined for each event by a weighted 
fit of all SVX tracks which have a z coordinate within 5 cm of the z-vertex position of 
the primary vertex associated with the trigger lepton. First, all tracks are constrained to 
originate from a common vertex. The position of this vertex is constrained by the transverse 
beam envelope described above. Tracks that have impact parameter significance \d\/ad, 
where is the estimate of the uncertainty on the impact parameter d, larger than three 
with respect to this vertex are removed and the fit is repeated. This procedure is iterated 
until all used tracks satisfy the impact parameter requirement. At least five tracks must 
be used in the determination of the transverse position of the primary vertex or we use 
the nominal beam-line position. The primary vertex coordinates transverse to the beam 
direction have uncertainties in the range of 10—25 /im, depending on the number of tracks 
and the event topology. 

The search for a secondary vertex in a jet is a two stage process. In both stages, tracks 
in the jet are selected based on the significance of their impact parameter with respect to 



the primary vertex. The first stage (see Table |XlD requires at least three candidate tracks 
for the reconstruction of the secondary vertex. Tracks consistent with coming from the 
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decay K s — > 7r + 7r~ or A — > 7r~p are not used as candidate tracks. Two candidate tracks 
are constrained to pass through the same space point to form a seed vertex. If at least 
one additional candidate track is consistent with intersecting this seed vertex, then the 
seed vertex is used as the secondary vertex. If the first stage is not successful in finding a 
secondary vertex, a second pass is attempted. More stringent track requirements (on \d\/(Jd 
and pt, for example) are imposed on the candidate tracks. All candidate tracks satisfying 
these stricter criteria are constrained to pass through the same space point to form a seed 
vertex. This vertex has an associated x 2 ■ Candidate tracks that contribute too much to the 
X 2 are removed and a new seed vertex is formed. This procedure is iterated until a seed vertex 
remains that has at least two associated tracks and an acceptable value of x 2 - Table pCT] lists 
the selection criteria used for the determination of the secondary vertex candidates. 

The decay length of the secondary vertex L xy is the projection of the two-dimensional 
vector pointing from the primary vertex to the secondary vertex on the jet axis; if the 
cosine of the angle between these two vectors is positive (negative), then L xy is positive 
(negative). Most of secondary vertices from the decay of b and c-hadrons are expected to 
have positive L xy . Secondary vertices from random combination of mismeasured tracks are 
expected to have a symmetric distribution around L xy =0 [28]. To reduce the background 
from false secondary vertices (mistags), a jet is considered tagged by SECVTX if it contains 
a secondary vertex with -^ £3L > 3.0, where err is the estimated uncertainty on L xy (~ 130 
fim). The mistag contribution to positive SECVTX tags is evaluated starting from the rate 
of negative SECVTX tags and detailed in Section [X|. 
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TABLE XL Selection criteria for CTC and SVX tracks used in the SECVTX 6-tagging algo- 



rithm. A good SVX hit is defined as a hit in the SVX linked to only one CTC 


track. 


Variable 




Cut 


CTC track selection criteria 




No. of axial superlayers 




> 2 


No. of hits in each axial super layer 




> 2 


No. of stereo superlayers 




> 2 


No. of hits in each stereo super layer 




> 2 


X 2 /d.o.{. of the track fit 




< 6 


z-vertex match 




< 5 cm 


SVX track selection criteria - Pass 1 




if A^svx-hits > 3 | 


( ArGood 

I iv SVX-hits 

[ PT 


> 1 
> 0.5 GeV/c 


if A^svx-hits = 2 | 


( ArGood 

I iv SVX-hits 

[ PT 


> 2 
> 1.5 GeV/c 


\d\ 




< 0.1 cm 


\d\/a d 




> 2.5 


SVX track selection criteria - Pass 2 




if A^svx-hits = 4 | 


f /V Good 

1 JV SVX-hits 

[ PT 


> 1 
> 1.0 GeV/c 


if iVsvx-hits = 3 | 


1 ^SVX-hits 
[ PT 


> 2 
> 1.0 GeV/c 


\d\ 




< 0.1 cm 


\d\/a d 




> 3.0 
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B. Jet-probability algorithm 



The jet-probability 5-tagging algorithm || is used to cross-check the SECVTX results. 
The jet-probability algorithm compares track impact parameters to measured resolution 
functions in order to calculate for each jet a probability that there are no long lived particles 
in the jet cone. This probability is uniformly distributed between and 1 for light quark 
or gluon jets, but is very small for jets containing displaced vertices from heavy flavor 
decays. We briefly describe the transformation from the track impact parameters to the 
jet-probability measure. 

The track impact parameter significance S is defined as the value of the impact param- 
eter d divided by its uncertainty ad- Tracks used in the calculation of jet-probability are 



required to satisfy the quality criteria listed in Table [XI 1| . The sign of the impact parameter 
significance is defined to be positive if the point of closest approach to the primary vertex 
lies in the same hemisphere as the jet direction, and negative otherwise. Figure ^| shows 
the distribution of the impact parameter significance of tracks in the JET 50 sample. This 
distribution is fitted with the resolution function 7Z(S). 
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FIG. 3. Distribution of the signed impact parameter significance of tracks in the JET 50 sample. 
The resolution function TZ(S) is the result of a fit using two Gaussians plus an exponential function, 
separately for the positive and negative sides. 

The negative side of the resolution function TZ(S) derived using JET 50 data is used to 
determine the probability P(Sq) that the impact parameter significance Sq of a given track 
is due to the detector resolution: 



Figure ^ shows that the impact parameter significance distribution of tracks in the JET 50 
data and in the corresponding simulation are slightly different. The resolution functions 
7Z{S) are therefore defined separately for the data and the simulation in order to account for 
the differences in the resolution between the true and the simulated detector performance. 



P(So) 



rf 01 n(S)ds 
j _ oo n(S)ds 
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FIG. 4. Distribution of the impact parameter significance djod of tracks in the JET 50 data 
(histogram) and the corresponding Herwig simulation (shaded histogram) . The tracks are required 



to satisfy the criteria listed in Table XII 



The probability that a jet is consistent with a zero lifetime hypothesis is calculated as: 



jpB = nE 

k=0 



k\ 



where n is the product of the individual probabilities of the N SVX tracks in a jet which 



satisfy the criteria listed in Table |XII| . Jet-probability is defined using tracks with positive 
impact parameter and requiring N > 2. We also define a negative jet-probability in which 
we select only tracks with negative impact parameter in the calculation. This is used as a 
control sample and a check of our method. 
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TABLE XII. Selection criteria for tracks used by the jet-probability algorithm. 

Variable Cut 

SVX track selection criteria 
\d\ < 0.15 cm 

p T > 1.5 GeV/c 

-^SVX-hits > 2 
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Figure |5] shows the positive and negative jet-probability distributions in a sample of 
JET 50 and JET 140 data. The positive jet-probability distribution shows jets containing 
hadrons with heavy flavor as a large excess at jet-probabilities smaller than 0.05 over a flat 
distribution. A jet has a positive JPB tag if the jet-probability value is smaller than 0.05. 
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FIG. 5. Distributions of positive and negative jet-probability in a mixture of JET 50 and JET 
140 data. The lines represent a fit to the negative distribution with a first order polynomial. The 
slope of the fit corresponds to a 1.6% change of the distribution over the entire jet-probability 



range. 



The negative jet-probability distribution is quite flat, as expected, since the resolution 
files were constructed using tracks with negative impact parameter. The small excess at 
negative jet-probability smaller then 0.05 (negative JPB tags) is due to the increase of the 
fraction of jets from heavy flavor in the JET 140 with respect to the JET 50 data. This 
excess largely disappears, as shown in Figure || when plotting the negative probability of 
jets which have a large positive jet-probability (0.1 — 1). Since tracks with negative signed 
impact parameter in JET 50 data are used to define the resolution function, the small 
contribution to negative tags from jets with heavy flavor is uncorrectly attributed to the 
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detector resolution by this procedure. It will be accounted for in the evaluation of the JPB 
mistags in Section [X]. 
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FIG. 6. Negative jet-probability distribution for jets with positive jet-probability greater than 
0.1. This selection requirement removes most of the jets with heavy flavor. The line corresponds 
to the fit to the negative JPB distribution shown in Figure |5|. 

Ideally JPB tags corresponding to jet-probability values smaller than 0.05 should contain 
a 5% mistag rate. This expectation is tested in Figure |6] fitting a first order polynomial 
function to the jet-probability distribution in the interval 0.1 — 1.0. The extrapolation of the 
fitted function predicts 4441±34 negative JPB tags while 4455 are observed; this corresponds 
to 4.94% of the total number (101050) of jets in the sample. 
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C. SLT algorithm 



The SLT algorithm tags b quarks by searching for an electron or muon from their de- 
cay. Low momentum or soft leptons can also result from 6-hadron decay through sequential 
c-decays, or r and J / ip cascade decays. This analysis follows the guidelines for the identifica- 
tion of soft electrons or soft muons documented in Refs. [^,^]. While previous measurements 
of the ti cross section used rates of events with SLT tags, in this analysis we search for soft 
lepton candidates only in a cone of radius 0.4 around the axis of a jet with E? > 15 GeV 
and |?7| < 2. 

To search for soft electrons, every CTC track with px > 2 GeV/c, which is associated 
to a jet, is extrapolated into the fiducial region of the calorimeter and is matched to a CES 
cluster. The matched CES cluster is required to be consistent in shape and position to the 
expectations for an electron. In addition, we require 0.7 < E/P < 1.5 and E had /E em < 0.1. 
The energy deposited by the track in the preradiator (CPR) is required to be consistent 
with an electron shower. The track ionization rate (dE/dx), derived from the charge de- 
position of the CTC hits associated with the track, is also required to be consistent with 
the electron hypothesis. Electrons from photon conversions are removed. Photon conver- 
sions are identified as combinations of the electron candidate and an additional track with 
opposite charge passing the criteria listed in Table |TV] with the additional requirement that 
the invariant mass be smaller than 500 MeV/c 2 . The selection criteria used to define the 



soft electron are described in more detail in Ref. [p9|l . The efficiency of each criteria used to 
select soft electron candidates has been measured using a sample of electrons produced by 
photon conversions f| (the efficiency of the E/P and E had /E em cuts is calculated using the 
simulation) . 

Soft muons are identified by matching CTC tracks with > 2 GeV/c to track segments 
in the CMU, CMP and CMX muon chambers. Muons candidate tracks with pt > 3 GeV/c 
that are extrapolated to the fiducial volume of both the CMU and CMP system are required 
to be matched to track segments in both muon detectors. To maintain high efficiency for 
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non-isolated muons, we do not impose minimum-ionization requirements on the calorimeter 
deposition. However, in order to reduce hadronic punch-through in the region not covered by 
the CMP system, we check that the energy, E^ad, in the tower traversed by muon candidates 
with pt > 6 GeV/c is consistent with the muon hypothesis; we require Ehad < G + J2p, where 
J2P is the scalar sum of the momenta of all tracks contained in a cone of radius 0.2 around 
the muon direction. The efficiency of each selection cut has been measured using a sample 
of J/ip — > n + n~ and Z — > /i + decays [^jp9|. 

Figures [7| and || show distributions of the invariant mass between primary and soft leptons 
in all W+ >1 jet events. As shown in Figure there is a handful of events where the soft 
muon is consistent with being the second leg of a Z boson decay embedded in a jet. Soft 
muons which, when combined with the primary muons of opposite charge, yield an invariant 
mass 70 < M MAt < 110 GeV/c 2 are not considered tags. 
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FIG. 7. Invariant mass distributions between the primary electron and the soft lepton can- 
didates in W+ >1 events. OS and SS refer to lepton pairs with opposite and same charge, 
respectively. 
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FIG. 8. Invariant mass distributions between the primary muon and the soft lepton candidates 
in W+ >1 events. OS and SS refer to lepton pairs with opposite and same charge, respectively. 
The shaded area indicates soft muons not considered tags. 



1. Fake soft lepton tags 

This background includes hadrons which pass the lepton selection cuts (such as pions 
which fake an electron or a muon) as well as electrons from conversions or muons from pions 
or kaons which decay in the detector. Because of the complication of accurately modeling 
these processes in the QFL detector simulation, this background is estimated using the data. 
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The SLT fake rate is measured starting from the ratio of the number of tracks passing the 
soft lepton selection criteria to the total number of tracks which satisfy the soft lepton fiducial 
requirements in generic-jet data flj29fl. In the JET 20, JET 50 and JET 70 samples the 
probability P that a track produces a SLT tag is computed separately for electrons and for 
different types of muon detectors (CMU,CMP and CMX). This probability is parametrized 
as a function of the track pt and isolation Since in this analysis we search a jet for 



SLT candidates in a cone of radius of 0.4 around its axis, we define a SLT probability per 
jet P J s f T (N) = Eili PstAi - 1) + (1 - HfAi ~ 1)) x Pi where N is the number of tracks 
contained in a cone of radius 0.4 around the jet axis. 

In Table pCIIIj the observed rates of SLT tags in various generic-jet samples are compared 
to the rates predicted by the probability Pslt described above. Since in generic-jet data 
the trigger jet is biased toward a lower yield of soft muons (a jet containing a muon has a 
lower energy deposition in the calorimeter and therefore is less likely to be the trigger jet) 
the comparison is performed with and without the trigger jet. However, when more than 
one jet is above the trigger threshold, all jets are considered. Excluding trigger jets from the 
comparison one observes agreement between the observed and predicted rates of tagged jets. 
The last seven samples shown in Table |X111| were not used to determine the SLT probability 



per track. Predicted and observed yields of SLT tags in all samples agree within 15%. As 
the amount and type of heavy flavor changes appreciably in different QCD samples (see 
Section [X]) the apparent agreement suggests that the rate of SLT tags in generic-jet data is 
dominated by fakes. 

The SLT fake probability is obtained by removing the contribution of SLT tags due to 
heavy flavor decays in the generic-jet data used to construct the SLT probability per track. 
For this purpose, we use the signed impact parameter significance distribution of the soft 
lepton tracks. The distribution observed in the data is fitted with the shape expected for 
leptons coming from the decay of b and c-hadrons, derived in simulated events, in addition to 
the shape of fake SLT tags. The shape for fake SLT tags is derived using all tracks taggable 
by the SLT algorithm in events which do not contain any SECVTX, JPB or SLT tags. 
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FIG. 9. Distribution of the signed impact parameter significance of SLT tracks contained in 
the JET 50 data (•). The solid histogram represents a fit using the shapes expected for b and c 
semileptonic decays and for fake tags. 



Figure |9] shows the signed impact significance distribution of SLT tags in JET 50 data 
along with the fit result. The composition of the SLT tags determined from these fits is 
(74.0 ± 3.2)% fakes, (10.5 ± 2.3)% 6's and (14.5 ± 4.3)% c's for all three generic-jet samples 
used to evaluate the SLT tagging probability. Due to the systematic associated with using 
simulated shapes for the impact parameter distribution of lepton coming from b and c- 
decays, we estimate a 10% error on the fraction of fake SLT tags determined by the fits. 
Based on this result, the SLT mistag probability per jet is obtained by rescaling the SLT 
tagging probability in generic-jet data by (74.0 ± 7.4)%. 
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TABLE XIII. Comparison of the observed and predicted yields of jets with SLT tags. 



Samples used in the fake parametrization 



Sample 


Predicted (P) 


Observed (O) 


(P-0)/0 


JET 20 


5353.9 


4994 


7.2% 


JET 20 without leading jet 


3392.4 


3383 


0.3% 


JET 50 


7082.9 


6408 


10.5% 


JET 50 without leading jet 


4947.4 


4988 


-0.8% 


JET 70 


8089.2 


7277 


11.2% 


JET 70 without leading jet 


5724.9 


5678 


0.8% 


Independent samples 


JET 100 


8603.6 


7483 


15.0% 


JET 100 without leading jet 


6109.8 


5909 


3.4% 


JET 140 


1324.1 


1196 


10.7% 


£ E T 175 


3392.6 


3392 


0.02% 


£ E T 125 4CL 


9651.9 


10095 


-4.4% 


£ E T 300 


1627.1 


1401 


16.1% 


Isolated 7 


365.8 


352 


3.9% 
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2. Simulation of the SLT algorithm 



A main feature of the soft lepton tagger is that it was developed from studying real leptons 
from photon conversions and J /if) mesons. The efficiency of each selection criteria is well 
measured in the data. Therefore, the simulation of the soft lepton tagger does not need to 
rely on the QFL modeling of the detector response in order to estimate the tagging efficiency. 
The SLT simulation matches tracks produced by QFL to electrons and muons at generator 
level. The electrons or muons are required to come from b or c decay or any of their cascade 
decays. Electron tracks are extrapolated to the CPR and CES detectors, and required to 
pass fiducial cuts. Electron candidates are eliminated if they are consistent with arising 
from photon conversions. Muon tracks are extrapolated to the muon detectors, required to 
pass the fiducial cuts and classified according to the muon detector type. Finally tracks are 
weighted with the measured efficiencies of the selection criteria, which are functions of the 
track transverse momentum [p[p9[. This procedure ensures that the simulation accurately 
models the soft lepton tagging efficiency. 

In Section |XI] we compare rates of SLT tags in generic-jet data to the corresponding 
simulation to verify that the procedure has been implemented correctly. By construction, 
the SLT simulation does not produce mistags. However, when applying the SLT mistag 
probability to tracks in simulated generic-jet data, we predict the same rate of fake SLT 
tags per jet that is observed in the data. This implies that tracks in simulated jets have the 
same multiplicity and transverse momentum distributions as in the data. 



IX. EFFICIENCY OF THE SECVTX AND JPB TAGGERS 



We first describe the calibration of the efficiency of the tagging algorithms in the simula- 



tion. For this purpose, we use the low-pr inclusive electron sample described in Section |VB 
and the corresponding simulation. A large fraction of the events in this sample is expected 
to originate from bb production in which a jet containing an electron from a semileptonic 
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6-decay, called e-jet, recoils against a jet from the other b, called the away-jet or a-jet. The 
tagging efficiency in the simulation, ejf c , is adjusted to the value of the tagging efficiency 
in the data using the scale factor 

op £b 

E b 

Following the derivation of the scale factor, Subsections A-I discuss the various sources 
of systematic uncertainty and also present cross-checks. In Subsection J we provide an 
explanation for the deviation of the scale factor from unity. 

The data sample consists of 55248 events. The simulated sample is generated with 
Herwig p3| (process 1500, 2^2 hard scattering with transverse momentum threshold 
p™ m > 13 GeV/c). Using the generic hard parton scattering, bb and cc pairs are produced 
through processes of order a 2 s as gg — ► bb (direct production). Processes of order are 
implemented in the generator through flavor excitation processes such as gb — > gb or gluon 
splitting, where the process gg — > gg is followed by g — > bb. We use the MRS(G) set 
of parton distribution functions PD[ . Apart from the parton distribution functions, the 
simulation package is the same as that used to generate W+ jet events. The generated hard 
scattering sample corresponds to an integrated luminosity of 83.5 pb _1 . In this sample we 
select events with an e-jet containing hadrons with heavy flavor. After applying the same 
selection used for the data, the simulated sample low-pr electron sample contains 16547 
events. 



Table [XIV| shows the composition of the simulated inclusive electron sample. One notices 
that 80% of the e-jet are due to bb production and that only 33% of the away-jets contain 
heavy flavor. 

In the simulated events where the away jet is tagged by SECVTX (ST a _j et ), 94% of the 
electron-jets are due to bb production. It is therefore convenient to measure the 6-tagging 
efficiency as the fraction of these events in which the electron-jet is tagged by SECVTX or 
JPB 

er = (!) 
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where SDT is the number of events where both the electron and away-jet contain heavy 
flavor and are tagged. The bb production accounts for 99% of the simulated events with a 
double tag. 

Table [XV| lists rates of tags in the data and in the simulation. In the simulation there 
are very few mistags and they are easily identified because the jet does not contain b or c- 
hadrons in a cone of radius 0.4 around its axis. In the data, the rate of mistags is evaluated 
using the parametrization described in Section |X|. 

We use the simulation to describe Fhf, the fraction of data in which electron-jets contain 
hadrons with heavy flavor. The data contain also a relevant number of e-jets in which the 
electron is not associated with the production of hadrons with heavy flavor (mostly from 
photon conversions in jets due to light quarks or gluons). In these events, the electron-jet 
contributes only mistags. To describe the remaining fraction (l — F^f) of the data, we make 
the additional assumption that away-jets in these events contain the same fraction of heavy 
flavor as generic-jets. The parametrization of the probability of tagging jets with heavy 
flavor in generic-jet data is derived in Section fX|. The 10% uncertainty associated with this 
parametrization is discussed in Section [IX A| . 
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TABLE XIV. Fractions of electron and away-jets before and after tagging in the low-px inclusive 
electron simulation. SN^- t and SN?™™? are the fractions of away-jets with and without heavy 



flavor. 





direct production 


flavor excitation 


gluon splitting 




b (%) 


c (%) 


b (%) 


c (%) 


b (%) 


c (%) 


SN e _ jet 


20.90 


3.49 


39.72 


10.26 


19.39 


6.22 


SNt f jet 

^prompt 

qrnSEC 

D1 e-jet 

qrpJPB 
J1 e~jet 

qrnSEC 
D1 a-jet 

qrnJPB 
J1 a~jet 

SDT SEC 


19.93 
1.64 
24.51 
23.57 
70.50 
67.59 
73.46 


3.31 
0.29 
0.68 
1.60 
3.07 
5.23 
0.54 


5.91 
35.65 
47.58 
44.64 
16.17 
15.06 
17.01 


1.35 
9.38 
2.55 
5.93 
2.29 
3.51 
0.43 


2.61 
19.60 
22.74 
20.75 
7.47 
7.11 
8.45 


0.53 
6.38 
1.93 
3.51 
0.51 
1.50 
0.11 



TABLE XV. Number of events before and after tagging electron and away-jets. Pqcd is the 
probability of tagging away-jets if they contain the same heavy flavor fraction as generic-jets (see 



text). 





Data 






Simulation 


Type 


Observed— mistags 


Pqcd(%) 


Type 


Observed-mistags 


N e -j e t 


55248 




SN e _j et 


16547 


T SEC 


8158 - 84.3 




qrnSEC 

01 e~jet 


4549- 


rpJPB 

± e-jet 


9123 - 335.3 




qrnJPB 
D1 e-jet 


5990- 


rpSEC 

a— jet 


3640 - 112.8 


1.67 


qrnSEC 

D1 a-jet 


1832- 7 


JjrpSEC 


1126 -23.8 




SDT SEC 


545- 1 


JjrpJPB 


1225 - 35.3 




SDT JPB 


743- 1 
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We use the following procedure to derive the tagging efficiency scale factor separately for 
SECVTX and jet-probability together with the heavy flavor purity F h f of the data. The 
data and the simulation are normalized to the same number of tagged electron-jets that 
contain heavy flavor, T e -j et and ST e ^j et , through the coefficient 

T e —j e t 



Before tagging, the heavy flavor purity of the data is therefore given by 

a ■ SN e _ jet 

Fhf ~ SF ■ N— t (2) 
where N e ^j et and SN e _j et are the number of e-jets in the data and the simulation and SF 
is the tagging efficiency scale factor. We start assuming SF = 1. It follows that in the 
data the number of events in which a tagged away-jet with heavy flavor is associated to an 
electron-jet without heavy flavor is 

T®-jet = (1 - Fhf) ■ N e _ jet ■ Pqcd 

and that the number of events in which a tagged away-jet containing heavy flavor is associ- 
ated with an electron-jet also containing heavy flavor is 

rnSEC _ rpQCD 

a— jet -La— jet' 

For the data the 6-tagging efficiency, analogy of equation (1), is then 

DT 

£b 



rpSEC _ rpQCD ■ 
a— jet a— jet 



where, as before, DT is the number of events in which the a-jet is tagged by SECVTX and 
the e-jet has a SECVTX or JPB tag. 

The ratio of the tagging efficiencies in the data and simulation yields the scale factor 



SF 



£b 



The value of the scale factor is inserted again in the equation (2) and we iterate until the 
scale factor value is stable to within 1%. 
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TABLE XVI. Data to Monte Carlo tagging 


; efficiency scale factors. F^f 


is the fraction of e-jets 


containing heavy flavor in the data. 






Sample 


SF 




SECVTX e-jet, SECVTX a-jet 


1.23±0.07 


43.5±2.9 % 


JPB e-jet, SECVTX a-jet 


0.96±0.05 


45.3±2.4 % 



55 



Using the numbers of electron and away-jets listed in Table |XV| , we derive 5F=1.23±0.07 
for SECVTX and 0.96±0.05 for jet-probability. The error accounts for the sample statistics 
(with the largest contribution coming from the simulation) and for 10% uncertainties in the 
evaluation of the mistag rates and in the prediction of the rate of tags in generic-jets with 
heavy flavor. 

The 6-purity of the e-jets before tagging, Fhf = (43.5 ± 2.9)%, is in agreement with the 
measurement in Ref. ||, 37±8%, using the fraction of tagged electron-jets that also contain 
a muon of opposite charge. 

The average SECVTX tagging efficiency is 36.7±1.9% in the data and (29.8±1.1)% 
in the simulation. The corresponding numbers for jet-probability are (39.2±2.1)% and 
(40.7±1.1)%, respectively. 

Since the tagging efficiencies depend on the jet energy, it is important to verify the 
agreement between the tagging efficiencies as a function of the transverse energy in the data 



and the simulation as is done in Figures [K] and [11]. The distributions of the lifetime and 
invariant mass of the SECVTX tags is given in Figure O and supports our determination 
of the 6-purity of the sample. The lifetime of a SECVTX tag is defined as 

L xy ■ M SVX 



pseudo-r- 

c • p T 

where M svx and p^ vx are the invariant mass and the transverse momentum of all tracks 
forming the SECVTX tag. 
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FIG. 10. Distributions of the transverse energy of electron-jets (a) and away-jets (b) tagged by 

SECVTX. 
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FIG. 11. Distributions of the transverse energy of electron-jets (a) and away-jets (b) in events 

with double SECVTX tags. 
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FIG. 12. Distributions of pseudo-r (a) and of the invariant mass (b) of SECVTX tags in elec- 
tron-jets; (c) and (d) are the analogous distributions for away-jets in events with double tags. 



A. Check of the background parametrization using a photon conversion sample 

In events where the e-jet does not contain heavy flavor, we predict the rate of tagged 
away-jets containing heavy flavor using the probability of tagging jets with heavy flavor as 
measured in generic-jet data. We test this method in a sample of data where the electrons 
in the e-jet are due to photon conversions. The criteria used to identify photon conversions 
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are listed in Table |I|. In this case we require that an electron is matched by a second track 
consistent with a photon conversion and that it is not matched by a track segment in the 
VTX detector. Otherwise, we select this sample as the inclusive electron sample where in 
contrast conversions were removed. 

Following the procedure used in the previous section, we determine the fraction of events 
with heavy flavor to be Fhf = 8.7±0.9% from the number of e-jets with a SECVTX or JPB 
tag. Tagging rates in events due to heavy flavor production are described using the Herwig 
simulation as used in the previous section. In the remaining 91.3% of the events, we describe 
the rates of tagged away-jets using the parametrization derived from generic jets. 



Table |XVI1| shows that this procedure correctly predicts the rates of tags observed in the 
data. We take the 10% statistical error of this comparison as the systematic uncertainty of 
the method. 
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TABLE XVII. Rates of events in which the electron jet is due to a photon conversion before 
and after tagging. The heavy flavor purity of this sample is F^j=8.7±0.9%. Events where the e-jet 
contains heavy flavor are described with the Herwig simulation. In the remaining events, the rate 
of tagged away-jets (QCD) is predicted using the probability for tagging jets with heavy flavor in 
generic-jet data. Mistags have been removed from the data and simulation. 



Type 

N e -jet 

rpSEC 

A e-jet 

rpJPB 

e—jet 

rpSEC 

a— jet 

rpJPB 

a— jet 



Data 
4027 
108.3±10.6 
133.1±12.5 
102.2±10.5 
135±13.7 



Simulation 
350±37 
114±12 
126±13 
41.6+5.0 
45.0±4.5 



QCD 
3677±37 



60.2±6.0 
86.7±8.7 



Prediction 
4027 
114±12 
126±13 
101.8 ± 7.8 
131.7 ± 9.8 
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B. Sensitivity of the scale factor to the modeling of c-jets 



In the simulation the tagging efficiency is defined as the ratio of events with double tags 
to all events where the away-jet is tagged by SECVTX. As shown in Table |X1V| , the Herwig 
simulation predicts that 94% of the a-jets with a SECVTX tag are due to bb production. The 
remaining 6% of the a-jets are due to cc production and are accounted for by the simulation 
but in principle this could be improperly modeled. In events where a-jets have a JPB tag, 
the fraction of cc production increases to 11% (see Table |X1V| ). If SECVTX and JPB scale 
factors are determined using a-jets tagged by JPB instead of a-jets tagged by SECVTX, 
the scale factors change each by less than 1%. Therefore, we conclude that the modeling of 
c-jets is satisfactory for the determination of the 6-tagging efficiency scale factor. 

C. Dependence of the scale factor on the gluon splitting cross section 

As shown in Table |XIV| , a fraction of the events in the inclusive electron sample is due 
to gluon splitting to heavy flavor quarks. The calibration of the Herwig simulation using 
generic-jet data in Section |X| shows that direct production and the heavy flavor excitation 
as implemented in Herwig provide a fair description of the data, but the gluon splitting cross 
section requires a (40 ± 20)% correction. We repeat the calculation of the scale factor using 
this larger gluon splitting cross section. We find that the SECVTX scale factor increases 
from 1.23 to 1.25. The final scale factor we use will be this latter value. 

D. Et dependence of the scale factor 

Jets produced directly in association with a W boson have transverse energies comparable 
to the jets in the low-pr inclusive electron sample. However, fe-jets produced by top decay 
have substantially higher transverse energies. In this section, we investigate a possible Et 
dependence of the scale factor using two methods. 
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First, we derive the value of the SECVTX scale factor in four different bins of the 
electron-jet transverse energy. In each bin, we calculate the average e-jet transverse energy 
< Et > and the scale factor using the iterative procedure previously described. The result 
of the study is shown in Figure [13|. A fit of the scale factor as a function of the transverse 
energy with a first order polynomial yields a x 2 of 0.3 for 2 d.o.f. and 

SF(E T ) = (1.23 ± 0.17) - (0.1 ± 4.0) x 10~ 3 x < E T > (GeV) 

with a correlation p = —0.95 between the two fit parameters. The result of this fit is 
therefore consistent with a constant scale factor. 
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FIG. 13. SECVTX tagging efficiency scale factor as a function of the average transverse energy 
< Et > of the electron-jet. The line represents a fit with a first degree polynomial. 



In the second method, we compare the fraction of jets with heavy flavor tagged by 
SECVTX in JET 50 and in JET 100 data and in the corresponding Herwig simulation 
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tuned as in Section [XI[ The 6-tagging efficiency in the detector simulation is increased by 
the factor 1.25 independently of the jet transverse energy. The ratio RSF of the fractions 
of tagged jets in the data and in the simulation is sensitive to any residual dependence 



of the scale factor. The result of this method is shown in Figure We fit the ratio RSF 
of the tagging efficiencies in the data to the simulation as a function of the jet transverse 
energy with a first order polynomial. The fit yields a \ 2 °f 51 for 49 d.o.f and 

RSF(E T ) = (1.01 ± 0.05) + (1.3 ± 4.6) x 10~ 4 x E T (GeV) 

with a correlation p = —0.92 between the two fit parameters. The data are consistent with 
a constant scale factor. 
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FIG. 14. Fractions of tagged jets (a) as a function of the jet transverse energy. The residual 

scale factor (b) is defined as the ratio of these fractions in the data and the simulation. The open 

circle in (b) represents the inclusive electron sample result. 



E. Uncertainty of the scale factor 



The SECVTX 6-tagging efficiency scale factor measurement using the inclusive electron 
sample has a 5.6% uncertainty. The uncertainty in the gluon splitting cross section predicted 
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by Herwig results in an additional 0.8% uncertainty of the scale factor. By folding the Et 
spectrum of 6-jets from top decays with the Et parametrization of the scale factor from the 
fit shown in Figure [13] (a variation of the fit parameters by ±ler yields a ±4.2% change in the 
efficiency for tagging 6-jets and ±3.9% change in the efficiency to tag top events), we estimate 
a 4% uncertainty from any residual Et dependence. These errors are mostly systematic and 
in general highly correlated. Altogether, we assign a 10% error to the determination of 
the scale factor after combining linearly the above contributions. Our final estimate of the 
6-tagging efficiency scale factor for the SECVTX algorithm is SF = 1.25 ± 0.13 and for the 
jet-probability algorithm is SF = 0.96 ± 0.10. The latter is consistent with unity. 



F. Check of the scale factor using a low-pr inclusive muon sample 

The \ow~pt inclusive muon sample is analogous to the electron sample in that a muon 
with pt > 10 GeV/c is required in place of an electron with Et > 10 GeV (see Section [V lj| ) . 
It provides an independent sample for checking the tagging efficiency scale factor. The low-p-r 
muon sample consists of 10393 events. In these events muon-jets without heavy flavor are due 
to fake muons arising from non-interacting hadrons or in-flight decays of K and 7r mesons. 
We compare to a simulated sample also generated using the option 1500 of Herwig which 
consists of 4280 events. The same procedure described above yields a SECVTX tagging 
efficiency scale factor of 1.24±0.10, in agreement with the value 1.23 ±0.07 derived in the 
inclusive electron sample (before correcting the gluon splitting cross section). At the same 
time the heavy flavor purity of the low-pr muon sample is measured to be F) t /=59.7±3.6%. 

G. Check of the scale factor in jets containing inclusive 6-decays 

In this section we investigate whether the scale factor is different in jets containing 
semileptonic 6-decays and inclusive 6-decays. We use the low-pr inclusive electron samples 
and normalize the data and the simulation to the same number of electron-jets with a 
SECVTX tag after mistag removal. In the simulation, the rate of gluon splitting to bb and 
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cc pairs is corrected as in Section [Xj. We compare rates of away-jets which are taggable and 
which are tagged by SECVTX. We find that the simulation predicts correctly the amount of 
taggable away-jets but it underestimates by a factor 1.23 ± 0.08 the rate of SECVTX tags 
with respect to the data. 

H. Check of the scale factor using rates of double tags in generic-jet data 



The studies of the Et dependence of the SECVTX scale factor performed in Section |IX D 
depends upon the assumption that Herwig models correctly the fractional yield of jets with 
heavy flavor as a function of their transverse energy. We use the JET 50 and JET 100 data 
and simulation for a test independent of this assumption. We select events with only two jets: 
one taggable jet with transverse energy larger than the trigger threshold and one taggable 
jet with Et > 15 GeV in the opposite hemisphere. We compare the number of events with 
double JPB tags and double SECVTX tags in the data and in the Herwig simulation after 
mistag removal. In the simulation, 92% of these double tags are due to bb production. The 
ratio of double SECVTX to double JPB tags in the data and the simulation is: 

#data = 0.92 ± 0.18 and R sim = 0.61 ± 0.05 

This ratio does not depend on the absolute cross section for producing jets with heavy flavor. 
From the equivalence 

Rdata _ ( SF SEC \ 

we measure gpjp ^ =1.24^=0.13 using generic jets with high transverse energy, in agreement 
with the value g^jpa =1-28±0.10 measured in the low-pj- inclusive electron sample. 

I. SECVTX efficiency for tagging c-jets 



Since we need to apply a large correction to the simulated SECVTX efficiency for tagging 
6-jets, it is worth investigating differences between data and simulation for tagging c-jets. For 
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this purpose, we compare rates of tags in the JET 50 and JET 100 data to the corresponding 
Herwig simulation, described in Section |XI], normalized to the same number of events. 

We define R as the ratio of the number of SECVTX to JPB tags after mistag removal. 
In the data R = 0.77 ± 0.07. Under the assumption that the heavy flavor composition of 
the data is modeled correctly by Herwig, the SECVTX scale factor for c-jets, SF^ EC , can 
then be derived solving the equivalence 

rpSEC . SpSEC _|_ j^SEC . gpSEC 
~ fJPB . 5 J? JPB 

where T^ EC =5354 and — 2477 are the number of simulated b and c-jets tagged by 
SECVTX, and T JPB =11958 is the number of JPB tags. Using SF b SEC =1.23 ± 0.07 and 
SF JPB =0.96 ± 0.05, we derive that the SECVTX scale factor for tagging c-jets is SF C = 
0.92 ± 0.28. The error is determined by the uncertainty of the heavy flavor composition 
(see Section |X| ) and by the errors of the scale factors SF^ and SF JPB . 



J. Understanding of the scale factor 

In an effort to explain the 25% difference of the SECVTX tagging efficiency in the data 
and the simulation we uncovered three oversights in the simulation package used in this and 
in some previous CDF analyses . A significant fraction of the difference is due to the use 
of an outdated version of the CLEO decay tables and to outdated 5-lifetimes in the CDF 
particle database. The above two inaccuracies account for ~ 40% of the difference of the 
SECVTX scale factor from unity. Small inconsistencies in the implementation of the SVX 
geometry in the simulation contribute an additional 16% to this difference. If we corrected 
for these effects, the new determination of the SECVTX scale factor would be 1.09±0.11; 
the uncertainty includes the error on the 6-lifetime (~3%) and the uncertainty of the track 
degradation procedure described in Section |VU| (~8%). The efficiency of jet-probability is 
not affected by these changes in the QFL simulation. 
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X. SECVTX AND JPB MISTAGS 



In this section we estimate the SECVTX and JPB mistag rate in a variety of control 
samples before applying it to W+ jet and Z+ jet events in Sections XII and XIII. Tags in 
jets without heavy flavor, which we call mistags, are caused by detector resolution effects. 
SECVTX mistags are poorly reproduced by our detector simulation and traditionally CDF 
removed this background from the data using a parametrization of the probability of finding 
negative SECVTX tags in JET 50 data We derive a new parametrization of the 

mistag rate using the JET 20, JET 50, JET 70, JET 100 and ££ T 300 data described 



in Section |V A| . Even if JPB mistags are well reproduced by the detector simulation, we 
derive a mistag parametrization also for JPB tags because jet-probability has a higher rate 
of mistags than SECVTX and provides a better check of the method. 

The method to evaluate the mistag probability starts with the measurement of the num- 
ber of positive and negative tags in generic-jet data and their parametrization as a function 
of the jet Et and the jet track multiplicity, N^rk- The tagging probability is derived as a 
ratio of the number of tags to the number of taggable jets in bins of transverse energy and 
track multiplicity. We use only jets which are far away from calorimeter cracks and correct 
the jet energy for the detector response and out-of-cone losses (see Section [111 D| ) . 

Negative tags are also produced in jets containing heavy flavor. In particular, the prob- 
ability of producing negative tags is different for jets initiated by a heavy-quark or by gluon 
splitting to a pair of heavy quarks. Since this contribution to negative tags must be ac- 
counted for and subtracted in order to obtain the mistag rate due to jets without heavy 
flavor, it is important to parametrize the rate of negative tags in a sample in which the 
composition of quark and gluon jets is well understood and is not subject to additional 
uncertainty due to the simulation. For this reason, in each generic-jet sample, we use only 
jets with transverse energy above the trigger threshold (leading jets): jets with corrected 
Et > 30, 70, 90, 120 and 160 GeV in the JET 20, JET 50, JET 70, JET 100 and ££ T 300 
data, respectively. In the generic-jet simulation, 95% of the leading jets with a tag contain 
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just one heavy-flavored hadron (a large fraction of these leading jets is produced by heavy 
quarks from flavor excitation or direct production). The region below 30 GeV is mapped 
selecting events containing two leading jets, but using only the additional jets in the event ; 
in the simulation, 96% of these non-leading jets which are tagged contain two hadrons with 
heavy flavor produced by a gluon splitting process. 

Transverse energy distributions of the jets used to measure the tagging probability are 



shown in Figures [15| and |16| Projections of the tagging probability matrices are shown in 



Figures |17] and [18 
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FIG. 15. Transverse energy distributions of (a) taggable jets, and jets with positive (b) and (c) 

negative SECVTX tags. 
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FIG. 16. Transverse energy distributions of jets with (a) at least two JPB tracks with positive 
impact parameter significance, (b) with positive JPB tags, (c) with two or more JPB tracks with 
negative impact parameter significance, and (d) with negative JPB tags. 
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FIG. 17. The positive and negative SECVTX tagging probability as a function of (a) the jet Er 
and (b) the number of SVX tracks in a jet. 
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FIG. 18. The positive and negative JPB tagging probability as a function of (a) the jet Er and 
(b) the number of SVX tracks in a jet. 



Figure O shows that the tagging probability parametrization derived using jets with well 
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measured energies works well for all jets. 
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FIG. 19. Pseudo-rapidity distributions of all jets tagged by SECVTX (•) are compared to the 
prediction derived using only jets away from calorimeter cracks (shaded histogram) in JET 20 and 
JET 50 data, (a) and (c) are negative tags; (b) and (d) are positive tags. 



Since the heavy flavor contribution to negative tags is expected to be small, the number 
of tags due to heavy flavor in a given bin of the tagging probability matrix is estimated 
fairly well by the difference, P — N, between the numbers of positive (P) and negative (N) 
tags in this bin. In simulated jets with heavy flavor, we measure the ratio, R = p^y, as a 
function of the jet transverse energy. We measure this ratio separately for jets which contain 
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only one hadron with heavy flavor (Ri) and for jets which contain two hadrons with heavy 
flavor (i?2)- The following empirical parametrization provides a good description of R for 
jets containing b as well as c-hadrons: 



Ri{E 7 



0.0088 + 0.000158 • E T for SECVTX 
0.039 + 0.00117 -E T forJPB 



i?2 (E[ 



T 



0.075 + 0.000158 • E T for SECVTX 
0.14 + 0.00117 ■ Et forJPB 

With this parameterization we construct the mistag probability matrix by correcting 
each bin of the negative tagging probability matrix by the factors: 

N - (P - N) • Rx(E T ) for jets with E T > 30 GeV 
N - (P - N) • R 2 (E T ) for jets with E T < 30 GeV 



The fraction of negative tags contributed by heavy flavors is shown in Table [XVI. 1 1 
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TABLE XVIII. Fraction of negative tags (%) due to heavy flavor as a function of the Et of the 
jet. 



Jet Et (GeV) 
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In the generic-jet samples used to derive the mistag matrices, approximately 70% of the 
events contain additional interactions. The rate of multiple interactions is different in other 
samples, e.g. W+ multi-jet events where we require an isolated primary lepton. The negative 
tagging rate in the generic-jet data depends on the number of additional interactions. 

Figure EUI shows the relative negative tagging probability, normalized to the average, 



as a function of the sum of the transverse momenta of all tracks associated with additional 
vertices displaced by more than 5 cm from the primary vertex Y^Pr- Accordingly, the mistag 
rate is parametrized with the additional empirical function for both SECVTX and JPB: 



0.8 + 0.0128 • EPt for EPt < 60 GeV/c 
1.57 for Y.Pt > 60 GeV/c 
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FIG. 20. Yield of the negative tagging probability as a function of J^Pt f° r ( a ) SECVTX and 
(b) JPB. The solid line represents an empirical parametrization described in the text. 
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A. Check of the SECVTX mistag parametrization 

In this section, we test the capability of our model to predict the rate of negative tags 
in all available generic-jet samples. 
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FIG. 21. E T distributions of jets with SECVTX tags in the JET 100 sample. On (a), observed 
positive tags (histogram) are compared to the predicted mistags (shaded histograms). On (b), 
observed negative tags (•) are compared to the predicted mistags (shaded histogram). On (c), 
predicted mistags (histogram) are compared to the predicted heavy flavor contribution to the 
negative tags (shaded histograms). On (d), observed negative tags (•) are compared to the sum of 
the predicted mistags and heavy flavor contribution to the negative tags (shaded histogram). 
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Figure ^T] serves to illustrate the procedure followed to predict the rates of negative tags. 
They are evaluated as the sum of the mistags plus the heavy flavor contribution using the 
Ri and R2 parametrizations derived in the previous section. This procedure requires the 
knowledge of the fraction of quark and gluon jets as a function of jet-^r in each data sample 
(literally, we need to know the fraction of jets containing one or two hadrons with heavy 
flavor). In the JET 20, JET 50, JET 70, JET 100, ££ T 175 and £ E T 300 samples we 
make the assumption, corroborated by the corresponding simulations, that all jets below 
trigger threshold are gluon jets and all jets above trigger threshold are quark jets. 



Figure ^Tj(a) shows the rate of observed positive tags and predicted mistags as a function 
of the jet E T . Figure 0(b) compares rates of negative tags to the predicted mistags. The 
mistag rate does not include any heavy flavor contribution and is lower than the observed 
rate of negative tags. Figure |2j](c) compares the rate of mistags and the heavy flavor 
contribution to the negative tags obtained by multiplying the difference between positive 



tags and predicted mistags in Figure ^T|(a) by Ri (R 2 ) if the jet-E T is above (below) the 
trigger threshold. Figure |2T](d) compares the observed and predicted yield of negative tags. 
The predicted yield is derived by adding the two distributions shown in Figure |2~T|(c). 

Following the same procedure, comparisons between the corrected jet E? distributions 
of observed and predicted negative SECVTX tags are shown in Figures ^2] to |24|. In the 
case of the Et 125 4CL sample, the ratio of quark to gluon jets (1/6, independent of E T ) 
is evaluated using the corresponding Herwig simulation. In the inclusive photon sample, 
we use only the R\ parametrization as the simulation shows that tagged jets are mostly 
contributed by the 7c Compton production. 

The inclusive low-pr electron sample, used to measure the tagging efficiency scale factor, 
is also a good sample to test the validity of the R\ and R2 parametrizations because it 
is enriched in heavy flavor content. We compare rates of observed and predicted negative 
tags both in the data and the corresponding Herwig simulation. The fraction of gluon jets 
in the simulation is taken from Table |X1V| . In the data, this fraction is increased by 40% 
according to the calibration of the Herwig simulation performed in Section [XI]. Comparisons 
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between observed and predicted rates of negative tags are shown in Figure |25] for the data 
and Figure for the simulation. 
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FIG. 22. Transverse energy distributions of jets with negative SECVTX tags. The 4 data 
samples were used for the construction of the mistag probability matrix. 
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FIG. 23. Transverse energy distributions of jets with negative SECVTX tags in the Et 175 (a) 
and Et 125 4CL (b) samples, which were not used for the construction of the mistag probability 
matrix. 
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FIG. 24. Transverse energy distributions of jets with negative SECVTX tags in the isolated 
photon sample. 
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FIG. 25. Transverse energy distributions in the inclusive electron data. In (a) e-jets with a 
negative SECVTX tag; (b) a-jets with a negative SECVTX tag in events where the e-jet is tagged 
by SECVTX; (c) e-jets with a negative JPB tag; (d) a-jets with a negative JPB tag in events where 
the e-jet is tagged by SECVTX. 
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FIG. 26. Transverse energy distributions in the inclusive electron simulation. In (a) e-jets with 
negative SECVTX tags; (b) a-jets with negative SECVTX tag; (c) e-jets with negative JPB tags; 
(d) a-jets with negative JPB tag. 



Table |XIX| summarizes the rates of observed and predicted negative SECVTX tags in all 
generic-jet samples. Based on the observed agreement a 10% systematic error is assigned to 
the estimate of the SECVTX mistag probability. 
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TABLE XIX. Numbers of observed positive and negative SECVTX tags in all generic-jet sam- 



pies. The method for 


predicting the 


■ number of negative tags, Pn, is exp 


lained in the text. 




Samples used in the mistag parametrization 


Sample 


Pos. tag 


;s Neg. tags 


Mistags 


Pn 


JET 20 


4731 


699 


652 


722 


JET 50 


6874 


1648 


1426 


1695 


JET 70 


7758 


2248 


1858 


2192 


JET 100 


8335 


2723 


2385 


2756 


£ E T 300 


1507 


501 


438 


521 


Independent samples 


£ E T 175 


3790 


947 


675 


908 


£ E T 125 4CL 


5637 


1203 


897 


1249 


Isolated 7 


284 


29 


35 


40 
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B. Check of the JPB mistag parametrization 



We follow the same procedure of the previous section to test the parametrization of the 



mistag rate of jet-probability. Figures trough ^ compare E T distributions of observed 
and predicted jets with negative JPB tags for all generic-jet samples. Rates of JPB tags are 
summarized in Table XX. As before, by comparing the observed and predicted number of 
negative tags, we assign a 10% systematic error to the parametrization of the JPB mistags. 
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FIG. 27. Transverse energy distributions of jets with a negative JPB tag. The 4 samples were 
used for the construction of the mistag probability matrix. 
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FIG. 29. Transverse energy distributions of jets with negative JPB tags in the isolated photon 
sample. 
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TABLE XX. Numbers of observed positive and negative JPB tags in all generic-jet samples. 
The method for predicting the number of negative tags, Pn, is explained in the text. 



Samples used in the parametrization 



Sample 


Pos. tags 


Neg. tags 


Mistags 


Pn 


JET 20 


8418 


3414 


2919 


3421 


JET 50 


12124 


5970 


4948 


6156 


JET 70 


13254 


7567 


6020 


7437 


JET 100 


14528 


8827 


7010 


8721 


£ E T 300 


2712 


1581 


1162 


1566 


Independent samples 


£ E T 175 


6217 


3235 


2227 


3069 


£ E T 125 4CL 


9283 


4407 


3166 


4481 


Isolated 7 


537 


179 


176 


209 
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XI. CALIBRATION OF THE FRACTION OF W + JET EVENTS WITH HEAVY 

FLAVOR 



Wbb and Wcc events are produced through the so-called gluon splitting process, where 
a final state gluon branches into a heavy quark pair. In this analysis the fraction of W+ 
jet events containing heavy flavor is estimated using the Herwig generator. The uncertainty 
in the rate of gluons splitting into heavy quarks based on the parton shower approach is 
estimated to be approximately 40% in Ref. JJTJ and approximately 25% in Ref. Because 
of this large uncertainty we calibrate the gluon splitting cross section modeled with Herwig 
using generic-jet data. Heavy flavor in generic-jet data stems from three primary sources: 
(1) direct production (e.g. gg — > bb); (2) flavor excitation (e.g. gb — > gb); and (3) gluon 
splitting. The calibration of the simulation package is performed by tuning the various 
cross sections calculated by Herwig to reproduce the tagging rate observed in the JET 50 
and JET 100 data. In these samples, the gluon splitting contribution is comparable to the 
other production mechanisms. In the JET 20 simulation, the gluon splitting contribution is 
negligible; we compare observed and predicted rates of tags in this sample using the tuned 
simulation as a check that we disentangled correctly the different heavy flavor production 
mechanisms. 

In each generic-jet sample we count the number of SECVTX tags in taggable jets. 
Mistags are evaluated using the mistag probability evaluated in Section |X|. 

The simulated samples (corresponding to the JET 20, JET 50 and JET 100 data) are 
generated using the option 1500 of Herwig and requiring hard scattering partons with \r}\ < 
4.5 and p^' m > 10, 40 and 80 GeV/c respectively |23|. We use the MRS(G) set of structure 
functions pj. Generated events are simulated with the standard package discussed in 



Section |VII| . As in the data, we select events containing at least one jet above the trigger 
threshold. 

In the simulation a jet is classified as a b or a c-jet if it contains a b or a c-hadron in a cone 



S9 



of radius 0.4 around its axis. Hadrons with heavy flavor resulting from the fragmentation 
of one of the hard scattering partons are indicative of direct production or flavor excitation 
(if one of the incoming partons of the hard scattering has heavy flavor we attribute the 
process to flavor excitation; in this case a second hadron of the same flavor is produced by 
the backward-evolution of the structure functions). All pairs of hadrons with heavy flavor 
of the same type which do not come from the hadronization of the hard scattering partons 
are attributed to gluon splitting. Table [XXI| lists the rate of jets containing heavy flavor per 



event in the simulated JET 50 and JET 100 samples. 

TABLE XXI. Average numbers of jets containing heavy flavor per event in the JET 50 and JET 
100 samples generated with Herwig, split by flavor type and production mechanism. 





direct production + flavor excitation 




gluon splitting 






Sample 


6-jets c-jets 


9 


^bb g^ 


cc 


Total 


JET 50 


2.14 x 10~ 2 3.04 x 10~ 2 


1.67 


x 10~ 2 3.79 x 


10~ 2 


10.64 x 10~ 2 


JET 100 


2.15 x 10" 2 2.89 x 10~ 2 


2.58 


x 10~ 2 5.73 x 


10~ 2 


13.35 x 10~ 2 
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In the data, in order to distinguish gluon splitting from the rest of the heavy flavor 
production we use intuitive kinematical differences. Jets from heavy flavor direct production 
are expected to be produced back-to-back and are more likely to produce double tags. In 
events produced by heavy flavor excitation, jets produced by the backward-evolution of the 
structure functions tend to be at large pseudo-rapidities and out of the SVX acceptance. On 
the other hand, gluon splitting produces pairs of jets with heavy flavor at small separation 



AR = J(A(j)) 2 + (A77) 2 . Most of the time the two hadrons with heavy flavor produced by 



gluon splitting reside in the same jet. Figure [3(J shows distributions of the distance between 
two 6-jets for the different production mechanisms in the simulated sample. In addition, the 
relative gluon splitting contribution increases with the jet multiplicity. 
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FIG. 30. Distributions of the distance AR between two 6-jets tagged by SECVTX in JET 50 
simulated events contributed by (a) direct production and flavor excitation or (b) gluon splitting, 
(c) and (d) are the distributions of the distance between a 6-jet tagged by SECVTX and the closest 
jet in the event with Et > 10 GeV. 



This motivates us to compare data and simulation in the following classes of SECVTX 
tags: 

• (1): number of tagged jets per event with at least one taggable jet. 

• (2): number of tagged jets per event with at least one taggable jet and with three or 
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more jets with Et > 15 GeV and \rj\ < 2. 

• (3): twice the number of events with two tagged jets per event with two or more 
taggable jets. 

We also compare the data to the simulation for: 

• (4): the fraction of (1) in which the tagged jet has a companion jet with Et > 10 GeV 
in a cone of radius 1.2 around its axis. 

• (5): the fraction of events with double tags where the two tagged jets are at a distance 
AR <1.2. 



Table XXII lists the yields of tags in the data and in the simulation. 
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TABLE XXII. Yields (xlO -3 ) of SECVTX tags in generic-jet data and in simulated samples 
generated with Herwig. Rows 1, 2 and 3 represent the average number of tags per event; rows 4 
and 5 represent the fraction of 1 and 3, respectively. Rates of simulated tags are not yet corrected 
for the tagging efficiency scale factor measured in Section pX] , 

JET 50 



direct production flavor excitation gluon splitting 



Class 


Data 


6-jets 


c-jets 


6-jets 


c-jets 


g^bb 


9 -> cc 


1 

J. 


qa 90+1 0^ 


2.90±0.11 


0.72±0.02 


6.53±0.17 


2.63±0.11 


7.37±0.18 


4.17±0.14 





ao nn_Li Q7 

4:O.UUXl.O / 


2.31±0.16 


0.53±0.08 


6.36±0.26 


2.26±0.16 


9.71±0.33 


5.32±0.24 


3 


7.50±0.65 


2.00±0.18 


0.16±0.04 


0.94±0.13 


0.07±0.03 


0.65±0.10 


0.09±0.02 


4 


5.60±0.38 


0.23±0.03 


0.04±0.01 


0.71±0.06 


0.25±0.03 


2.17±0.10 


0.86±0.06 


5 


0.58±0.08 


0.00±0.08 


0.00 


0.08±0.03 


0.00 


0.26±0.05 


0.00 








JET 100 












direct production 


flavor excitation 


gluon splitting 


Class 


Data 


6-jets 


c-jets 


6-jets 


c-jets 


g^bb 


9 -> cc 


1 


42.05±1.84 


4.31±0.22 


1.23±0.12 


5.57±0.25 


2.24±0.16 


11.85±0.37 


6.88±0.28 


2 


51.50±2.04 


3.51±0.27 


0.86±0.13 


5.71±0.35 


1.97±0.20 


15.06±0.56 


8.40±0.42 


3 


15.50±0.92 


2.68±0.29 


0.26±0.09 


1.08±0.18 


0.05±0.03 


1.42±0.21 


0.10±0.05 


4 


6.36±0.41 


0.64±0.09 


0.09±0.03 


0.97±0.10 


0.31±0.06 


5.03±0.24 


2.10±0.15 


5 


l.lOiO.ll 


0.00±0.03 


0.00 


0.06±0.03 


0.00 


0.54±0.09 


0.00 
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In the simulation, one notes that after tagging with SECVTX the contribution of c-jets 
is reduced by more than a factor of four and becomes negligible in events with double tags. 
However, the ratio of double to single SECVTX tags does not discriminate between bb and 
cc production for this ratio is similarly small for bb production through flavor excitation and 
gluon splitting. 

We discriminate the flavor type with the additional comparison of rates of JPB tags 
(JPB has about the same tagging efficiency of SECVTX for fc-jets and is more than twice 
as efficient for tagging c-jets). Since we use JPB tags only to disentangle between b and 
c-production, we compare data and simulations in only two classes of JPB tags: 

• (6): number of tagged jets per event with at least one taggable jet. 

• (7): twice the number of events with two tagged jets per event with two or more 
taggable jets. 



Table [XX111| lists the yields of JPB tags in the data and in the simulation. 

TABLE XXIII. Fractions (xlO -3 ) of JPB tags per event in generic-jet data and in simulated 
samples generated with Herwig. Fractions of tags are not yet corrected for the tagging efficiency 



scale factor measured in Section IX 







JET 50 








direct production flavor excitation 


gluon splitting 


Class 


Data 


6-jets c-jets 6-jets c-jets 


g — » bb g — > cc 


6 


45.20±3.19 


3.84±0.13 1.87±0.09 7.97±0.19 6.28±0.17 


9.11±0.21 8.67±0.20 


7 


4.75±0.28 


1.62±0.12 0.26±0.05 0.81±0.09 0.23±0.05 


0.89±0.09 0.52±0.06 






JET 100 








direct production flavor excitation 


gluon splitting 


Class 


Data 


6-jets c-jets 6-jets c-jets 


g — » bb g — > cc 


6 


53.07±5.09 


5.72±0.26 2.66±0.18 6.86±0.29 5.69±0.26 


14.22±0.42 13.13±0.40 


7 


5.50±0.34 


2.11±0.19 0.39±0.08 0.78±0.11 0.25±0.06 


1.69±0.17 1.06±0.13 
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We proceed to fit the data with the simulation in order to extract the rate of g — > bb and 
g — ► cc. When fitting the simulation to the data, the yield of simulated SECVTX and JPB 
tags is corrected for the tagging efficiency scale factors measured in Section ITX|. The 10% 



uncertainty in the scale factor determination is included in the error of the simulated rates 
of tags. In the fit, we also compare five distributions in each generic-jet sample and in the 
corresponding simulation: 

1. the yield of the fraction of SECVTX tags per taggable jet as a function of the jet-Er- 

2. The distributions of the distance AR between a jet tagged by SECVTX and a com- 
panion jet as defined above. 

3. The distributions of the distance AR between a jet tagged by JPB and a companion 
jet as defined above. 

4. The distributions of the distance AR between two jets tagged by SECVTX. 

5. The distributions of the distance AR between two jets tagged by JPB. 

In the comparison, the area of each distribution is normalized to unity. For each distribution 
we compute a reduced % 2 

2 1 * (d(i) - sdji)) 2 
Xd ~ N fr{ ed{%Y + esd{i) 2 

where N is the number of bins in each distribution, d(i) and sd(i) are the contents of the 

bin i of the distribution in the data and the simulation respectively, with ed(i) and esd(i) 

their errors. The simulated jet-Er distributions have a systematic uncertainty due to the 

trigger simulation which is cumbersome to account for in the fit. Simulated distribution of 

distances between tagged jets have unaccounted systematic uncertainties due to how well 

the parton shower generator models gluon splitting at distances AR > 1.2. We use the 

reduced xd to diminish the importance of these comparisons with respect to the classes of 

absolute tagging rates. The data are fitted to the simulation using a minimum \ 2 method. 

We minimize the function 
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2 J ™ f (F(j) - SF(j)) 2 * fe 
where the index k runs over the 5 kinematic distributions described in the previous para- 



graph, F(j) are the yields of tags observed in the data for the classes listed in Tables [XXII 
and pCXmi , and 

SF(f) = j2P(n)-C H (j,n)*SF a 

n=l 

is the corresponding yield of simulated tags. The contributions Cn{j,n) of different flavor 
types and production mechanisms, as listed in Tables |X.X.1J| and |XX111| , are weighted with 



the fit parameters P(n). SF is the tagging efficiency scale factor and a=0 for c-jet, 1 for 
events with one tagged 6-jet and 2 for events with two tagged 6-jets. 

In the fit, the b-to-c ratio for direct production is constrained to the default value with 
a 14% Gaussian error. Option 1500 of Herwig evaluates the direct production cross section 
of massless quarks. The 14% uncertainty accounts for having neglected the quark masses 
(estimated using option 1700 of Herwig) and for the uncertainty in the fragmentation process 
(estimated using the Pythia generator). 

The 6-to-c ratio for flavor excitation is also constrained to the default value with a 28% 
Gaussian error. This uncertainty accounts for the largest variation of this ratio observed 
using a wide range of structure functions in the PDF library |J3 . 



The ratio of the g — > bb to g — > cc is also constrained to the default value with a 28% 
Gaussian error. The uncertainty accounts for a ±0.5 GeV change of the b and c-quark 
masses around the default value. 

The fit has 21 degrees of freedom and yields a \ 2 °f 22. The fit results are shown in 



Table [XXIV| . The weights of the gluon splitting cross sections will be used to rescale the 
fraction of W+ jet events with heavy flavor predicted by Herwig. 

Figure ^T] compares the E T distributions of tagged jets in the data and in the fitted 



simulation. Similarly, Figures [32] and |33] compare distributions of distances between tagged 
jets. 
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FIG. 31. Fractions of taggable jets with a SECVTX tag as a function of jet Et in the data and 
in the fitted simulation. The distributions of the data and the fitted simulation are normalized to 
unit area. 
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FIG. 32. Distributions of the distance AR\ between a jet tagged by SECVTX tag and the closest 
jet in the event and of the distance AR2 between two jets tagged by SECVTX. The distributions 
of the data and the fitted simulation are normalized to unit area. 
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AR X AR 2 
FIG. 33. Distributions of the distance AR\ between a jet with a JPB tag and the closest jet in 
the event and of the distance AR2 between two jets with a JPB tag. The distributions of the data 
and the fitted simulation are normalized to unit area. 
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TABLE XXIV. Result of the fit of the Herwig simulation to the JET 50 and JET 100 data (see 
text). 

Process Cross section weight 

b direct production+flavor excitation 1.09 ± 0.15 

g -► bb 1.40 ± 0.19 

b Total 1.22 ± 0.12 

c direct production+flavor excitation 1.12 ± 0.28 

g^cc 1.35 ± 0.36 

c Total 1.25 ± 0.20 

b + c direct production+flavor excitation 1.11 ± 0.16 

g^bb,cc 1.36 + 0.22 

6 + c Total 1.24 + 0.12 
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Table |XXV| compares rates of tags in generic-jet data and in the Herwig simulation 
calibrated according to Table [X.X1V| . The JET 20 sample was not used to calibrate the 
simulation package. Similarly, the SLT algorithm was not used in the Herwig calibration. 
The comparison of the number of SLT tags in the data and simulation serves to check 
independently the calibration of the Herwig production cross section and the SLT tagging 
efficiency in the simulation. 

TABLE XXV. Number of tagged jets with heavy flavor in generic-jet data and in the calibrated 
Herwig simulation. Data and simulation are normalized to the same number of events. The second 
and third columns list the number of tags and removed mistags in the data. 



Tag type 
SECVTX 
JPB 
SLT 



JET 20 (194009 events) 

Tags Mistags Data 

4674 616 4058±92 

8343 2801 5542±295 

4994 3962 1032±402 



Simulation 
4052±143 
5573±173 
826±122 



Tag type 
SECVTX 
JPB 
SLT 



JET 50 (151270 events) 
Tags Mistags Data 

6536 1360 5176±158 

11533 4700 6833±482 

6408 5241 1167L530 



Simulation 
5314±142 
6740±171 
1116L111 



Tag type 
SECVTX 
JPB 
SLT 



JET 100 (129434 events) 

Tags Mistags Data 

7682 2227 5455±239 

13365 6494 6871±659 

7483 6367 1116±642 



Simulation 
5889±176 
7263±202 
1160±168 



102 



XII. COMPOSITION OF THE W+ >1 JET SAMPLE 

The background to the tt production is determined using the data or the simulation 
calibrated as described in Sections [IXJ and |XJ. The tt production cross-section is determined 
by attributing the excess of tagged W+ > 3 jet events to tt production. W + 1 and W + 2 jet 
events provide a check of the background calculation. The evaluation of the backgrounds to 
tt events is detailed in the subsections A-H. The results of these background determinations 
are listed in the tables in Section |XV| where the cross section is calculated. The following 
two Sections |X111| and IXTVI provide checks of the background estimates. 



A. non-VF background 



As in previous analyses fl3l |, the background from non-iy sources, including bb pro- 



duction, is determined directly from the data by studying the isolation of primary lepton 
candidates in the low (# T < 10 GeV) and in the high (# T > 20 GeV) t T region. The 
number of non-W events in each jet-bin is evaluated as 

N A 

Nnon-W — Nc X — — 

where Na, Nb and Nc are the number of events in regions A, B and C of Figure The 
corresponding number of tagged events is 

where P tag is the tagging probability measured in region A. The yield of P tag as a function 



of the lepton isolation is shown in Figure 35 
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FIG. 34. Distributions of the primary lepton isolation vs. $ T in W+ >1 jet candidate events. 
The three regions A, B, C are used to evaluate the non-VF contribution in the region D, which 
defines the W signal. 



104 




0.1 0.2 0.3 0.4 0.5 0.1 0.2 0.3 0.4 0.5 



Isolation Isolation 



FIG. 35. The tagging probability as a function of the isolation of the primary lepton. 
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B. Z —> t + t events 

The Z — > t + t~ contribution is estimated using the Pythia generator (option MSEL=13). 
The simulation is normalized to the same number of Z — > e + e~ events observed in 

the data. 



C. Single top production 

The single top contribution via the W — gluon fusion channel is estimated with Herwig 
using the process 2000. The single top production for the annihilation process qq — > W* — > tb 
is estimated using the Pythia generator (option MSEL=12). We use the cross sections 



o"vk- 9 =1-5±0.4 pb for W — g fusion derived using the NLO calculation in Ref. |35| and 



<T w »_ >t 5=0.74±0.05 pb for the annihilation process [|35|. 



D. Diboson production 

The contribution of the ZZ, WZ and WW production is estimated using the Pythia 
generator (options MSEL=15 and ISUB=22, 23 and 25, respectively). We use the cross 
sections a(WW)=9.5±0.7 pb, a(W Z) =2. 6±0. 34 pb and <r(ZZ) = 1.0±0.2 pb |§. 



E. Mistags 

The SECVTX and SLT mistags are calculated weighting each jet in the W sample 
with the mistag probability matrices derived in Section |X| and Section [VIII C 1| , respectively. 
The reevaluation of the SECVTX mistag matrix has resulted in a reduced estimate of this 
background in the signal region by (50 ±5)% compared with the previous estimates of 
Ref. 0,| . 

For the jet-probability algorithm, each simulated background also includes the contribu- 
tion of mistags. The number of JPB mistags is evaluated only for the fraction of W+ jet 
events which is not simulated, i.e. W+ jet direct production without heavy flavor. 
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F. The W + bb and W + cc contribution 



We use the Herwig generator (process 2100 with hard scattering p™ m > 10 GeV/c) to 
estimate the fraction of W+ > n jet events, F®, in which only one jet contains hadrons 
with heavy flavor resulting from gluon splitting (a refers to the flavor type). The fraction 
of W+ > 2 jet events, F£, in which two different jets contain hadrons with heavy flavor is 
calculated using the Vecbos generator (see Section |VTp. The fractions of W+ jet events with 
heavy flavor content are listed in Table [XXVI. We use the Herwig and Vecbos simulations 



also to determine the efficiency for finding events with one or two tagged jets, as listed in 



Table XXVII. 



It follows that the number of tagged Wbb and Wcc events is 



where Nw is the number of W events in the data after removing the predicted number of 
noia-W, di-bosons, single top, unidentified Z and tt events. 
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TABLE XXVI. Fraction of W+ > 1 jet events with heavy flavor jets as a function of the jet 
multiplicity. 







Wbb 






Wcc 




Sample 


Ft 


(%) 


F* (%) 


F{ 


(%) 


F| (%) 


W + ljet 


0.80±0.11 




2.01+0.54 




W + 2 jet 


1.28±0.18 


1.20±0.38 


3.73+1.00 


1.40+0.52 


W + 3 jet 


1.88±0.31 


1.90±0.62 


5.31±1.48 


2.30+0.91 


W+ > 4 jet 


3.54±1.06 


2.40+0.77 


6.08±2.45 


3.00±1.13 


TABLE XXVII. Tagging efficiencies (e) in Wbb and Wcc simulated events. 








SECVTX 








Sample 
W + ljet 


e ltag (%) 

24.6±0.8 


e ltag (%) 


e 2tag (%) 


e ltag (%) 

4.56+0.29 


e ltag (%) 


e 2tag (%) 


W + 2jet 


21.6±1.7 


45.8+1.8 


10.6±1.2 


3.59±0.49 


11.7±1.1 


0.4+0.2 


W+ > 3jet 


20.6±4.4 


46.8+4.0 


10.7±2.8 
JPB 


3.59±1.26 


14.3±2.3 


0.0+0.0 


Sample 
W + ljet 


(%) 
23.8±0.7 


£ ltag (%) 


4 & ag (%) 


e ltag (%) 

9.8+0.4 


£ ltag (%) 


4% (%) 


W + 2]et 


20.3±1.4 


40.7+1.5 


10.0±0.9 


7.8±0.7 


25.0±1.4 


2.8+0.5 


W+ > 3jet 


21.7±3.8 


43.2+3.4 


9.3±1.9 
SLT 


13.0±2.2 


25.6+2.9 


1.7±0.8 


Sample 
W + ljet 


(%) 
7.7±0.9 


^ag (%) 


4l g (%) 


(%) 

3.7+0.5 


^ag (%) 


4% (%) 


W + 2]et 


6.9+1.2 


13.2+1.7 


0.6+0.3 


3.7±0.6 


6.2±1.0 


0.1+0.1 


W+ > 3jet 


7.1+2.6 


9.6+2.3 


0.5+0.5 


5.2±1.6 


8.0±2.0 


0.0+0.0 
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G. The Wc contribution 



The fraction F Wc of gs — > VTc and g<i — > Wc events is evaluated using the Herwig 
W+ 1 jet simulation and is shown in Table XX VIII . The estimated uncertainty on Fwc 
is dominated by the uncertainty in the strange sea content in the proton, computed by 
examining a wide range of different structure functions in Ref. 0. The tagging efficiencies 
for this process are listed in Table [XXIX. 
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TABLE XXVIII. Fractions of Wc events. 



Sample 
W + ljet 
W + 2)et 
W + 3 jet 
W+ > 4 jet 



Fwc (%) 
4.8±1.4 
7.2±2.2 
7.5±2.3 
7.5±2.3 



TABLE XXIX. Tagging efficiencies in VFc events. 

Sample e s w E c c (%) £ j%* (%) egjf (%) 

W + ljet 4.1±0.4 8.7±0.4 3.3±0.4 

W + 2]et 4.2±0.6 10.8+1.0 5.2±0.7 

W+ > 3 jet 4.5±0.6 16.7+2.9 6.9±2.0 
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H. Direct production of Z+ jet with heavy flavor 

We use the Pythia generator (option MSEL=13) to estimate the fraction of unidentified 
Z+ jet events passing our selection. The simulation is normalized to the number of Z — > ti 
observed in the data. The fraction of Z+ jet events containing heavy flavor is determined 
by rescaling the Herwig prediction for the fraction of W+ jet events with heavy flavor. The 
rescaling factor is derived by comparing W+ 1 jet and Z+ 1 jet events generated with 
Pythia (option MSEL=14 and 13, respectively). For the same number of generated events, 
we find 0.3 ± 0.15 Zc events for every We event, 1 ± 0.3 Zee events for every Wee event, 
and 2.0 ± 0.5 Zbb events for every Wbb event. 



XIII. CHECK OF THE BACKGROUND CALCULATION USING THE Z+ > 1 

JET SAMPLE 



The production mechanisms of W and Z bosons in association with jets are very similar. 
The contribution of ti events to the Z+ jet events is negligible. This sample provides a good 
benchmark for our background calculation. The selection of the Z+ >1 jet event sample is 
described in Section IV A . Table XXX shows the predicted composition of the Z+ >1 jet 
sample before tagging. In this Table the number of Z+ jet events is derived from the data by 
subtracting the WW, WZ, ZZ, ti, and single top contribution. The measured and predicted 
rates of events with SECVTX, JPB and SLT tags are shown in Tables |XXXI| to |XXXIII| . The 
product of the four Poisson probabilities, one per jet bin, for the observed numbers given 



the prediction is P = 1.2 X 10^ for Table [XXXg , P = 2.1 x 10~ 4 for Table PCXXIH and 
Po = 1.0 x 10~ 3 for Table [XXXHIl With a Monte Carlo simulation, in which we fluctuate 



the predicted rates by their uncertainty according to a gaussian distribution, we estimate 
that the likelihood of observing a probability no larger than P is 33.8% for events with 
SECVTX tags, 17.9% for events with JPB tags and 41.1% for events with SLT tags. In Z+ 
jet events the background prediction agrees with the data. 
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TABLE XXX. Composition of the Z+ >1 jet sample before tagging. We use (J t j= 5 pb from 
Ref 1. 



Source 


Z + ljet 


Z + 2 jet 


Z + 3jet 


Z+ > 4 jet 


Data 


1148 


159 


16 


4 


WW 


0.8±0.2 


0.2±0.1 


O.OiO.O 


O.OiO.O 


WZ 


2.2±0.5 


1.7±0.4 


0.3±0.1 


O.liO.O 


zz 


1.2±0.3 


1.6±0.4 


0.3±0.1 


O.OiO.O 


Zc 


16.5±4.9 


3.3±1.0 


0.3±0.1 


O.liO.O 


Zbb 


18.3±2.5 


7.6±1.3 


1.1±0.2 


0.4±0.1 


Zee 


23.0±6.1 


7.9±1.7 


1.1±0.3 


0.3±0.1 


Z+ jet without h.f. 


1085.3±8.3 


135.3±2.5 


12.2±0.4 


2.9±0.1 


Single top 


O.ldbO.O 


O.OiO.O 


O.OiO.O 


O.OiO.O 


it 


0.6±0.1 


1.4±0.3 


0.5±0.1 


0.2±0.0 
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TABLE XXXI. Summary of observed and predicted number of Z events with one (ST) and two 
(DT) SECVTX tags. 



Source 


Z + ljet 


Z + 2jet 


Z + 3 jet 


Z+ > 4 jet 


Mistags 


1.27±0.13 


0.34±0.03 


0.08±0.01 


0.01±0.01 


WW, wz, zz 


0.09±0.03 


0.18±0.05 


0.03±0.01 


O.OOiO.OO 


Zc 


0.67±0.21 


0.15±0.05 


0.02±0.00 


O.OOiO.OO 


Zee, Zbb (ST) 


5.56±0.70 


2.59±0.46 


0.40±0.08 


0.14±0.03 


Zee, Zbb (DT) 




0.39±0.13 


0.06±0.03 


0.02±0.01 


Single top 


0.01±0.01 


0.00±0.00 


0.00±0.00 


0.00±0.00 


It [p 1 ) 


U.zzztU.UD 


n a a 4~n no 


U.zu±U.Uo 


U.UozhU.Ui 


tt (DT) 




0.23±0.06 


0.07±0.02 


0.03±0.01 


Prediction (ST) 


7.83±0.74 


3.70±0.47 


0.73±0.10 


0.20±0.03 


Prediction (DT) 




0.62±0.14 


0.13±0.03 


0.04±0.01 


Data (ST) 


10 


3 





1 


Data with (DT) 




2 
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TABLE XXXII. Summary of observed and predicted number of Z events with one (ST) and 
two (DT) JPB tags. 



Source 


Z + ljet 


Z + 2jet 


Z + 3 jet 


Z+ > 4jet 


Mistags 


5.65±0.57 


1.51±0.15 


0.34±0.04 


0.05±0.01 


WW, wz, zz 


0.13±0.03 


0.24±0.06 


0.02±0.01 


0.02±0.01 


Zc 


1.39±0.44 


0.35±0.11 


0.05±0.02 


O.OliO.OO 


Zee, Zbb (ST) 


6.63± 0.87 


2.85± 0.45 


0.50±0.09 


0.17±0.03 


Zee, Zbb (DT) 




0.42±0.13 


0.06±0.02 


0.02±0.01 


Single top 


0.01±0.00 


O.OOiO.OO 


O.OOiO.OO 


O.OOiO.OO 


It (p 1 ) 


n i 74~n no 
U.l l ±u.uz 


U.4o±U.UD 


u.iy±u.uo 


U.U4±U.U1 


tt (DT) 




0.17±0.02 


0.06±0.01 


0.03±0.00 


Prediction (ST) 


13.98±1.13 


5.37±0.49 


1.11±0.10 


0.30±0.04 


Prediction (DT) 




0.59±0.13 


0.12±0.02 


0.05±0.01 


Data (ST) 


11 


5 


1 


2 


Data (DT) 
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TABLE XXXIII. Summary of observed and predicted number of Z events with one (ST) and 
two (DT) SLT tags. 



Source 


Z + ljet 


Z + 2 jet 


Z + 3jet 


Z+ > 4 jet 


Mistags 


12.65±1.27 


3.66±0.37 


0.57±0.06 


0.15±0.02 


WW, wz, zz 


0.04±0.02 


0.09±0.03 


O.OlztO.Ol 


O.OliO.01 


Zc 


0.55±0.17 


0.17±0.05 


0.02±0.01 


O.OliO.OO 


Zee, Zbb (ST) 


2.26zt0.36 


1.10±0.19 


0.16zt0.03 


0.06±0.01 


Zee, Zbb (DT) 




0.02zt0.01 


O.OOztO.OO 


O.OOiO.OO 


Single top 


O.OOztO.OO 


O.OOztO.OO 


O.OOztO.OO 


O.OOztO.OO 


tt (ST) 


04±0 00 

V / • V 7 -L — 1 — \ J • V7 V ' 


19±0 02 


08±0 01 


01±0 00 


tt (DT) 




O.OOztO.OO 


O.OOiO.OO 


O.OOztO.OO 


Prediction (ST) 


15.54ztl.33 


5.21±0.42 


0.85±0.07 


0.24zt0.02 


Prediction (DT) 




0.03±0.01 


O.OliO.OO 


O.OOztO.OO 


Data (ST) 


16 


3 





1 


Data (DT) 














XIV. RATES OF NEGATIVE TAGS IN THE W+ > 1 JET SAMPLE 

As shown in Section [X|, the mistag rates plus the estimated heavy flavor contribution to 
the negative tags account for the observed rates of negative tags in all generic-jet data. A 
similar test in the W+ >1 jet sample offers an additional check of the mistag rate predictions 
and a complementary test of the method used to estimate the background contribution to 
the tt signal. 

The rate of negative tags for each process is calculated from the corresponding simulation 
or using the data as we do for positive tags. We use the sample composition before tagging 
listed in Tables |XXXV1| and |XXX Villi for SECVTX and JPB, respectively. Table |XXX1V 



compares numbers of observed and predicted negative SECVTX tags as a function of the 



jet multiplicity. The analogous comparison for negative JPB tags is shown in Table [XXXV . 
Data and predictions agree within the estimated uncertainties. 
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TABLE XXXIV. Summary of the predicted and observed number of W+ jet events with nega- 
tive SECVTX tags. The contribution of each process before tagging is taken from Table XXXVII . 



Source 


W + ljet 


W + 2 jet 


W + 3 jet 


W+ > 4 jet 


Mistags 


10.82± 1.08 


3.80± 0.38 


0.99± 0.10 


0.35± 0.04 


Non-iy 


0.30± 0.15 


0.30± 0.21 


0.00± 0.35 


0.00± 0.14 


WW, wz, zz 


0.00± 0.00 


0.04± 0.04 


0.00± 0.00 


0.00± 0.00 


Single top 


0.07± 0.02 


0.05± 0.02 


0.01± 0.00 


0.00± 0.00 


Wc 


0.69± 0.32 


0.34± 0.15 


0.12± 0.09 


0.02± 0.02 


Wcc (ST) 


0.34± 0.15 


0.18± 0.07 


0.07± 0.05 


0.01± 0.01 


Wcc (DT) 




0.00± 0.00 


0.00± 0.00 


0.00± 0.00 


Wbb (ST) 


1.42± 0.26 


0.32± 0.09 


0.08± 0.05 


0.02± 0.02 


Wbb (DT) 




0.00± 0.00 


0.00± 0.00 


0.00± 0.00 


Z —* rr 


0.00± 0.00 


0.00± 0.00 


0.00± 0.00 


0.00± 0.00 


Zc 


0.01± 0.00 


0.00± 0.00 


0.00± 0.00 


0.00± 0.00 


Zee (ST) 


0.01± 0.00 


0.00± 0.00 


0.00± 0.00 


0.00± 0.00 


Zee (DT) 




0.00± 0.00 


0.00± 0.00 


0.00± 0.00 


Zbb (ST) 


0.08± 0.01 


0.02± 0.01 


0.01± 0.01 


0.00± 0.00 


Zbb (DT) 




0.00± 0.00 


0.00± 0.00 


0.00± 0.00 


tt (ST) 


0.01± 0.00 


0.12± 0.03 


0.31± 0.08 


0.27± 0.07 


it (DT) 




0.00± 0.00 


0.00± 0.00 


0.00± 0.00 


Prediction (ST) 


13.74± 1.18 


5.18± 0.48 


1.60± 0.39 


0.69± 0.17 


Prediction (DT) 




0.00± 0.00 


0.00± 0.00 


0.00± 0.00 


Data (ST) 


19 


7 


2 





Data (DT) 
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TABLE XXXV. Summary of the predicted and observed number of W+ jet events with negative 
JPB tags. The contribution of each process before tagging is taken from Table XXXIX . 



Source 


PF + ljet 


W + 2jet 


W + 3 jet 


W+ > 4 jet 


Mistags 


41.81± 4.24 


12.99± 1.35 


2.25± 0.28 


0.25± 0.19 


Non-TV 


2.74± 0.45 


1.42± 0.43 


0.39± 0.19 


0.16± 0.08 


WW, wz, zz 


0.50± 0.15 


0.74± 0.19 


0.36± 0.13 


0.02± 0.01 


Single top 


0.23± 0.05 


0.34± 0.08 


0.09± 0.03 


0.02± 0.01 


Wc 


9.31± 2.91 


1.82± 0.67 


0.46± 0.21 


0.04± 0.03 


Wcc (ST) 


4.55± 1.27 


0.71± 0.24 


0.26± 0.12 


0.03± 0.02 


Wcc (DT) 




0.00± 0.00 


0.00± 0.00 


0.00± 0.00 


Wbb (ST) 


3.14± 0.50 


1.77± 0.36 


0.39± 0.11 


0.06± 0.04 


Wbb (DT) 




0.01± 0.01 


O.Olrb 0.01 


0.00± 0.00 


Z — > TT 


0.44± 0.20 


0.52± 0.21 


0.09± 0.09 


0.00± 0.00 


Zc 


0.08± 0.02 


0.02± 0.01 


0.01± 0.00 


0.00± 0.00 


Zee (ST) 


0.12± 0.04 


0.02± 0.01 


0.01± 0.01 


0.00± 0.00 


Zee (DT) 




0.00± 0.00 


0.00± 0.00 


0.00± 0.00 


Zbb (ST) 


0.17± 0.03 


0.10± 0.02 


0.04± 0.02 


0.01± 0.01 


Zbb (DT) 




0.00± 0.00 


0.00± 0.00 


0.00± 0.00 


it (ST) 


0.12± 0.03 


1.21± 0.26 


2.98± 0.65 


3.35± 0.73 


it (DT) 




0.06± 0.01 


0.09± 0.02 


0.26± 0.06 


Prediction (ST) 


63.21± 5.34 


21.65± 1.67 


7.35± 0.80 


3.93± 0.76 


Prediction (DT) 




0.08± 0.02 


0.10± 0.02 


0.26± 0.06 


Data (ST) 


66 


23 


8 


5 


Data (DT) 




1 





1 



117 



XV. MEASUREMENT OF THE tt PRODUCTION CROSS SECTION 



The tt production cross section is derived using 

Afobs _ Afbkg 
1 * tag 1 * tag 

Attets.gJCdt 

where iV t a g S is the number of tagged W+ > 3 jet events, N^g is the background prediction, 
Ati is the detector acceptance for tt events, e tag is the efficiency for tagging top events and 
/ Cdt = 105.1 ± 4.0 pb _1 is the total integrated luminosity. 

The acceptance for ti events is evaluated with a simulation which uses the Pythia gener- 
ator and is (7.8±1.3)% for a top mass of 175 GeV/c 2 . The 17% systematic error accounts for 
all uncertainties in the simulation which come from the following: lepton identification and 
trigger simulation (± 14%), jet energy scale (± 5%), modeling of initial state gluon radiation 
(± 2%), final state gluon radiation (± 5%), Monte Carlo modeling of the ti production (± 
5%), detector resolution effects (± 2%), and instantaneous luminosity dependence (± 2%). 

The tagging efficiencies are evaluated using the same simulation and are 0.505 ± 0.051, 
0.455 ± 0.046 and 0.157 ± 0.016 for SECVTX, JPB and SLT respectively. 

In the background calculation the rate of W+ jet events with heavy flavor is estimated 
from the number of events due to W+ jet direct production using the fraction of heavy 
flavor determined in Section |X11 F. Therefore the contribution of ti events must be removed 
from the data. This is done by iterating. The ti cross section is estimated from the excess of 
tagged W+ > 3 jet events over the background calculated assuming a t i = 0. The resulting 
ati is used to evaluate the number of ti events before tagging; this contribution is subtracted 
from the data to obtain the contribution of the W+ jet direct production. The amount of 
W+ jet with heavy flavor is recalculated and a t i is updated. The procedure is repeated until 
a t i is stable to within 1%. 

In the sample of 252 W+ > 3 jet events, there are 29 events with at least one jet tagged 
by the SECVTX algorithm. Using the procedure described above, the background estimate 
is 8.0 ± 1.0 events. Assuming that all the excess is due to ti production, the resulting ti 
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cross section is 5.08 ± 1.54 pb (the statistical error is ± 1.30 pb and the systematic ± 0.82 
pb). The estimated break-down of the W+ > 1 jet sample before and after tagging is shown 
in Tables |XXXV1| and [XXXV11 . 

As a cross-check, we calculate a t i using rates of JPB tags. There are 41 events with at 
least one jet tagged by the JPB algorithm with a background of 11.1 ± 1.3 events. The 
observed excess of events yields a t t — 8-02 ± 2.16 pb. The estimated break-down of the 



W+ > 1 jet sample before and after tagging is shown in Tables |XXXVIlT| and |XXXIX 



There are 25 events with at least one jet tagged by the SLT algorithm with a background 
of 13.2 ± 1.2 events. The observed excess of events yields a t t — 9.18 ± 4.26 pb (the statistical 
error is ± 3.89 pb and the systematic ± 1.72 pb). The estimated break-down of the W+ > 



1 jet sample before and after tagging is shown in Tables [XL| and [XLI , 

Because of a small dependence of the acceptance and the tagging efficiencies on the top 
mass, for a ±5 GeV/c 2 change in the top mass the cross sections calculated using SECVTX 
and JPB tags change by ±1.8% and the cross section calculated using SLT tags changes by 
±2.3%. 
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TABLE XXXVI. Composition of the W+ >1 jet sample before tagging using a t j = 5.08 ± 1.54 
pb. 



Source 


W + ljet 


W + 2 jet 


W + 3 jet 


W+ > 4 jet 


Data 


9454 


1370 


198 


54 


Non-W 


560. 1± 14.9 


71. 2± 2.7 


12. 4± 2.0 


5.1± 1.7 


WW 


31. 2± 5.4 


31. 1± 5.4 


5.2± 1.0 


0.8± 0.2 


WZ 


4.4± 0.9 


4.8± 1.0 


0.9± 0.2 


0.1± 0.0 


ZZ 


0.3± 0.1 


0.4± 0.1 


0.1± 0.0 


0.0± 0.0 


Unidentified- Z 


234. 8± 14.5 


38.5± 5.9 


7.9± 2.4 


0.7± 0.7 


Single top 


14.1± 2.1 


7.9± 1.7 


1.7± 0.4 


0.3± 0.1 


Wc 


413.1±123.9 


86.8± 26.1 


11.2± 3.4 


1.9± 0.7 


Wbb 


69.0± 9.5 


29.7± 5.1 


5.7± 1.1 


1.5± 0.5 


Wcc 


173.1± 46.2 


61.9± 13.6 


11.4± 2.6 


2.3± 0.9 


W+ jet without h.f. 


7952.0±133.6 


1027.7± 31.1 


121. 1± 7.7 


19.9± 6.1 


it 


1.8± 0.5 


10.1± 2.8 


20.3± 5.7 


21.3± 5.9 
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TABLE XXXVII. Summary of the predicted and observed number of W events with one (ST) 
or two (DT) SECVTX tags. 



Source 


W + ljet 


W + 2]et 


W + 3jet 


W+ > 4 jet 


Mistags 


10.82 ± 1.08 


3.80 ± 0.38 


0.99 ± 0.10 


0.35 ± 0.04 


Non-W 


8.18 ± 0.78 


1.49 ± 0.47 


0.76 ± 0.38 


0.31 ± 0.16 


WW, wz, zz 


0.52 ± 0.14 


1.38 ± 0.28 


0.40 ± 0.13 


0.00 ± 0.00 


Single top 


1.36 ± 0.35 


2.38 ± 0.54 


0.63 ± 0.14 


0.14 ± 0.03 


Wc 


16.89 ± 5.38 


3.94 ± 1.30 


0.51 ± 0.17 


0.09 ± 0.04 


Wcc (ST) 


7.89 ± 2.17 


3.54 ± 0.88 


0.77 ± 0.25 


0.16 ± 0.07 


Wcc (DT) 




0.06 ± 0.04 


0.00 ± 0.00 


0.00 ± 0.00 


Wbb (ST) 


17.00 ± 2.41 


8.35 ± 1.74 


1.62 ± 0.40 


0.41 ± 0.14 


Wbb (DT) 




1.51 ± 0.52 


0.31 ± 0.13 


0.07 ± 0.03 


Z — > rr 


0.96 ± 0.30 


0.70 ± 0.25 


0.17 ± 0.12 


0.00 ± 0.00 


Zc 


0.14 ± 0.04 


0.03 ± 0.01 


0.01 ± 0.00 


0.00 ± 0.00 


Zee (ST) 


0.22 ± 0.06 


0.10 ± 0.03 


0.04 ± 0.02 


0.00 ± 0.00 


Zee (DT) 




0.00 ± 0.00 


0.00 ± 0.00 


0.00 ± 0.00 


Zbb (ST) 


0.93 ± 0.14 


0.46 ± 0.12 


0.17 ± 0.06 


0.02 ± 0.02 


Zbb (DT) 




0.08 ± 0.03 


0.03 ± 0.02 


0.00 ± 0.00 


Total background (ST) 


64.90 ± 6.45 


26.26 ± 2.51 


6.11 ± 0.68 


1.50 ± 0.23 


lotal background (Dl) 




1.65 ± 0.52 


0.34 ± 0.13 


0.07 ± 0.03 


it (ST) 


0.54 ± 0.14 


3 34 ± 87 


6 76 ± 1 76 


7.42 ± 1.93 


ti (DT) 




0.76 ± 0.20 


2.88 ± 0.75 


3.96 ± 1.03 


ti + background (ST) 


65.44 ± 6.45 


29.61 ± 2.66 


12.87 ± 1.89 


8.92 ± 1.95 


it + background (DT) 




2.41 ± 0.56 


3.23 ± 0.76 


4.03 ± 1.03 


Data (ST) 


66 


35 


10 


11 


Data (DT) 




5 


6 


2 
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TABLE XXXVIII. Composition of the W+ > 1 jet sample before tagging using a t i = 8.02 ± 
2.16 pb. 



Source 


W + ljet 


W + 2 jet 


W + 3 jet 


W+ > 4 jet 


Data 


9454 


1370 


198 


54 


Non-VF 


560. 1± 14.9 


71. 2± 2.7 


12.4± 2.0 


5.1± 1.7 


WW 


31. 2± 5.4 


31. 1± 5.4 


5.2± 1.0 


0.8± 0.2 


WZ 


4.4± 0.9 


4.8± 1.0 


0.9± 0.2 


0.1± 0.0 


ZZ 


0.3± 0.1 


0.4± 0.1 


0.1± 0.0 


0.0± 0.0 


Unidentified- Z 


234. 8± 14.5 


38.5± 5.9 


7.9± 2.4 


0.7± 0.7 


Single top 


14.1± 2.1 


7.9± 1.7 


1.7± 0.4 


0.3± 0.1 


Wc 


413.1+123.9 


86.4± 25.9 


10.3± 3.2 


1.0± 0.7 


Wbb 


69.0± 9.5 


29.5± 5.1 


5.3± 1.0 


0.8± 0.5 


Wcc 


173.1± 46.2 


61.6± 13.5 


10.5± 2.5 


1.2± 0.8 


W+ jets without h.f. 


7951.0±133.5 


1022.7± 31.1 


111.6± 9.2 


10.3± 8.3 


tt 


2.9± 0.7 


15.9± 3.8 


32.1± 7.7 


33.6± 8.1 
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TABLE XXXIX. Summary of the predicted and observed number of W events with one (ST) 



or two (DT) jet-probability tags. 


Source 


W + ljet 


W + 2]et 


W + 3 jet 


W+ > 4 jet 


Mistags 


41.80 ± 4.24 


12.78 ± 1.33 


2.19 ± 0.27 


0.25 ± 0.19 


Non-W 


12.55 ± 0.95 


2.53 ± 0.61 


0.57 ± 0.33 


0.24 ± 0.14 


WW, WZ, ZZ 


1.15 ± 0.26 


2.39 ± 0.43 


0.74 ± 0.19 


0.05 ± 0.04 


Single top 


1.32 ± 0.32 


2.19 ± 0.51 


0.59 ± 0.14 


0.11 ± 0.03 


Wc 


34.80 ±10.58 


9.02 ± 2.84 


1.67 ± 0.59 


0.16 ± 0.11 


Wcc (ST) 


17.02 ± 4.60 


7.24 ± 1.73 


1.70 ± 0.45 


0.20 ± 0.14 


Wcc (DT) 




0.47 ± 0.20 


0.05 ± 0.03 


0.01 ± 0.01 


Wbb (ST) 


16.43 ± 2.32 


7.47 ± 1.52 


1.47 ± 0.35 


0.21 ± 0.14 


Wbb (DT) 




1.42 ± 0.48 


0.25 ± 0.10 


0.03 ± 0.02 


Z — ► TT 


2.35 ± 0.47 


1.13 ± 0.32 


0.17 ± 0.12 


0.09 ± 0.09 


Zc 


0.28 ± 0.09 


0.08 ± 0.03 


0.03 ± 0.01 


0.00 ± 0.00 


Zee (ST) 


0.46 ± 0.13 


0.20 ± 0.06 


0.09 ± 0.04 


0.01 ± 0.01 


Zee (DT) 




0.01 ± 0.01 


0.00 ± 0.00 


0.00 ± 0.00 


Zbb (ST) 


0.90 ± 0.14 


0.42 ± 0.10 


0.16 ± 0.06 


0.02 ± 0.02 


Zbb (DT) 




0.07 ± 0.03 


0.03 ± 0.01 


0.00 ± 0.00 


Total background (ST) 


129.08 ±12.56 


45.53 ± 4.00 


9.43 ± 0.97 


1.34 ± 0.34 


Total background (DT) 




1.97 ± 0.52 


0.33 ± 0.10 


0.04 ± 0.02 




n sn + n 1 7 






±u.u± in L.o£t 


tt (DT) 




1.10 ± 0.24 


3.90 ± 0.85 


5.46 ± 1.19 


ti + background (ST) 


129.87 ±12.56 


50.30 ± 4.14 


19.37 ± 2.38 


11.95 ± 2.35 


tt + background (DT) 




3.07 ± 0.57 


4.23 ± 0.86 


5.50 ± 1.20 


Data (ST) 


124 


62 


21 


12 


Data (DT) 




6 


5 


3 
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TABLE XL. The composition of the W+ >1 jet sample before tagging using o t i = 9.18 ± 4.26 
pb. 



Source 


W + ljet 


W + 2 jet 


W + 3 jet 


W+ > 4 jet 


Data 


9454 


1370 


198 


54 


Non-lU 


560.1± 14.9 


71.2± 2.7 


12.4± 2.0 


5.1± 1.7 


WW 


31.2± 5.4 


31.1± 5.4 


5.2± 1.0 


0.8± 0.2 


WZ 


4.4± 0.9 


4.8± 1.0 


0.9± 0.2 


0.1± 0.0 


zz 


0.3± 0.1 


0.4± 0.1 


0.1± 0.0 


0.0± 0.0 


Unidentified- Z 


234.8± 14.5 


38.5± 5.9 


7.9± 2.4 


0.7± 0.7 


Single top 


14.1± 2.1 


7.9± 1.7 


1.7± 0.4 


0.3± 0.1 


Wc 


413.1±123.9 


86.3± 25.9 


10.0± 3.2 


0.6± 1.3 


Wbb 


69.0± 9.5 


29.5± 5.1 


5.1± 1.1 


0.5± 1.0 


Wcc 


173.1± 46.2 


61.5± 13.5 


10.1± 2.6 


0.8± 1.6 


W+ jet without h.f. 


7950.6±133.5 


1020.8± 31.8 


107.8± 17.3 


6.6± 17.5 


tt 


3.3± 1.5 


18.2± 8.2 


36.7± 16.5 


38.5± 17.3 
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TABLE XLI. Summary of the predicted and observed number of W events with one (ST) or 



two (DT) SLT tags. 


Source 


W + ljet 


W + 2\et 


W + 3 jet 


W+ > 4 jet 


Mistags 


101.92 ±10.19 


30.90 ± 3.09 


7.34 ± 0.73 


3.01 ± 0.30 


Non-W 


8.96 ± 0.84 


2.09 ± 0.56 


0.38 ± 0.27 


0.16 ± 0.11 


WW, wz, zz 


0.50 ± 0.16 


0.88 ± 0.22 


0.10 ± 0.05 


0.00 ± 0.00 


Single top 


0.38 ± 0.10 


0.67 ± 0.15 


0.18 ± 0.05 


0.05 ± 0.01 


Wc 


13.12 ± 4.27 


4.26 ± 1.45 


0.65 ± 0.29 


0.04 ± 0.09 


Wcc (ST) 


6.41 ± 1.89 


2.68 ± 0.66 


0.61 ± 0.21 


0.05 ± 0.10 


Wcc (DT) 




0.02 ± 0.02 


0.00 ± 0.00 


0.00 ± 0.00 


Wbb (ST) 


5.31 ± 0.96 


2.84 ± 0.67 


0.41 ± 0.13 


0.04 ± 0.08 


Wbb (DT) 




0.09 ± 0.05 


0.01 ± 0.01 


0.00 ± 0.00 


Z — > tt 


0.43 ± 0.20 


0.09 ± 0.09 


0.09 ± 0.09 


0.00 ± 0.00 


Zc 


0.11 ± 0.04 


0.04 ± 0.01 


0.01 ± 0.01 


0.00 ± 0.00 


Zee (ST) 


0.17 ± 0.05 


0.08 ± 0.02 


0.03 ± 0.01 


0.00 ± 0.00 


Zee (DT) 




0.00 ± 0.00 


0.00 ± 0.00 


0.00 ± 0.00 


Zbb (ST) 


0.29 ± 0.06 


0.16 ± 0.04 


0.05 ± 0.02 


0.01 ± 0.01 


Zbb (DT) 




0.00 ± 0.00 


0.00 ± 0.00 


0.00 ± 0.00 


Total background (ST) 


137.60 ±11.29 


44.66 ± 3.60 


9.86 ± 0.88 


3.35 ± 0.36 


Total background (DT) 




0.10 ± 0.05 


0.01 ± 0.01 


0.00 ± 0.00 




n ot; + n 11 


9 AA -4- 1 07 


^ 1 A -4- 9 9^ 


a no _i_ o aa 
u.uo in z.uu 


tt (DT) 




0.07 ± 0.03 


0.24 ± 0.10 


0.32 ± 0.14 


tt + background (ST) 


137.85 ±11.29 


47.10 ± 3.75 


15.00 ± 2.41 


9.43 ± 2.68 


it + background (DT) 




0.17 ± 0.06 


0.25 ± 0.10 


0.32 ± 0.14 


Data (ST) 


146 


56 


17 


8 


Data (DT) 
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XVI. COMBINED TT PRODUCTION CROSS SECTION 



The best measurement of the tt cross section comes from combining the results of this 
analysis with the dilepton and all-hadronic analyses |16| , |37|] . The revised fe-tagging efficiency 



reported in this paper effects the cross section measured in the all-hadronic channel. The 



details of this analysis have not changed from those reported in Ref. ||37|| . The cross section 



measurement from the dilepton channel [16| does not require 6-tagging information and so is 
unchanged by the revisions reported here. It is affected slightly by the revised determination 
of the total integrated luminosity as were all measurements. A comparison between the 
previously published results and the revised cross sections used for the new combined result 
is shown in Table [Xbll| . 

We combine the measurements from the SVX and SLT tagged lepton + jets, all-hadronic, 
and dilepton channels, using a maximum likelihood technique similar to that described in 
Refs. |||5|. This procedure properly accounts for correlated systematic uncertainties, such 
as the uncertainties in the 6-tagging efficiency, the luminosity, the kinematic acceptance, 
and some of the calculated backgrounds. In all channels the acceptances are calculated with 
M top = 175 GeV/c 2 . The resulting combined ti production cross section is 

a t t = 6.5+};4pb 

where the quoted uncertainties include both statistical and systematic effects, which are 
approximately equal in magnitude. 

TABLE XLII. Summary of old and new CDF ti production cross section results. 
Channel Previous result New result 

Lepton + jets (SVX) 6.2±?;£ P b 5 - 1 ± L5 P b 

Lepton + jets (SLT) 9.2lf ;jj pb 9.2 ± 4.3 pb 

Dilepton 8.2±|;| pb 8.4±f;f pb 

All-hadronic 10.1±|;| pb 7.6±|;f pb 
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XVII. CONCLUSIONS 



Having improved the method for determining the 6-tagging efficiency, and the method 
for calculating the backgrounds to tt production, we revise our previous measurements of 
a t t in the lepton + jets channel [0]. We find <j t t — 5.08 ± 1.54 pb and <j t t — 9.18 ± 4.26 pb 
using events with SECVTX and SLT tags, respectively. We have used the jet-probability 
algorithm as a cross-check and find that it gives a result consistent with these measurements. 
The measurement of the tt cross section, obtained by combining the results of this analysis 
with the dilepton and all-hadronic analyses, is a t t = 6.5ti;Ipb, in agreement with the SM 
predictions M and the measurement performed by the D0 collaboration 
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